GENES RESPONSIBLE FOR VERNALIZATION REGULATION IN TEMPERATE 

GRASSES AND USES THEREOF 
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10/412,137, filed April 11, 2003, which is incorporated by reference herein in its 
entirety. 

GOVERNMENT INTEREST 

This invention was made with Government support under Grant (or Contract) 
No. 2000-1678 awarded by the USDA-NRI. The Government has certain rights In the 
Invention. 

FIELD OF THE INVENTION 

This invention Is in the field of plant breeding and plant molecular biology. In 
particular, this invention relates to non-naturally occurring plants with an altered 
response to vernalization or flowering time, and to molecular markers for the natural 
occurring alleles of the vernalization genes. 

BACKGROUND OF THE INVENTION 

Little is known about the molecular regulation of the vernalization response in 
grasses. If the molecular mechanism of the vernalization response was better 
understood, the response could be engineered to alter a plant's response to 
vernalization to improve flowering, growth efficiency and, ultimately, yield. Also, being 
able to control flowering may allow better control over breeding of plants. There Is thus 
a tremendous need to Identify molecular factors Involved with a plant's response to 
vernalization. In addition, there Is a need to Identify promoters involved in the 
vernalization response and factors that regulate such promoters. 
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SUMMARY OF THE INVENTION 

In order to meet these needs, the present invention Is directed to the finding 
that the API promoter controls the vernalization response In wheat. The "API promoter 
sequence" as defined herein refers to any sequence that hybridizes to the nucleic acid 
molecule of SEQ ID NO: 12 (Fig. 9) or the complement thereof under at least low 
stringency, preferably moderate, high or very high stringency conditions, or any 
sequence that includes the critical regulatory recognition sites for vernalization present 
In SEQ ID NO: 12, Including the CCTCGI 1 1 IGG (SEQ ID NO:23) sequence located -162 
to -172 bp upstream from the start codon of the API gene. This 11-bp region will be 
referred hereafter as the "CArG-box". In addition, the "API promoter sequence" may 
also include sequences sharing at least 75%, 80%, 85%, 90%, 95%, or 97% Identity 
with SEQ ID NO: 12. The present invention is thus directed to a recombinant API 
promoter sequence such as those depicted in Figures 9A-B and 11. In particular, the 
present invention is directed to recombinant API promoters and their use In plant 
molecular biology and plant breeding. In a first format, the recombinant API promoter 
sequence with all or a portion of the CArG box may be operably linked to any 
heterologous protein coding sequence and introduced Into a plant to regulate the 
expression of the protein by vernalization. 

In a second format, the API promoter with or without part or all of the CArG box 
may be operably linked to an API protein encoding sequence and Introduced Into a 
plant to modify flowering time or the vernalization requirement in the plant. The API 
protein encoding sequences of the invention include those sequences that hybridize 
under high stringency conditions to a nucleic acid selected from SEQ ID N0:5, SEQ ID 
N0:6, SEQ ID NO: 18 and SEQ ID NO:22. The API protein coding sequences may 
encode API proteins such as those having polypeptide sequences selected from SEQ ID 
NO: 7, 8, 19, 20 and 21. 

The recombinant API promoter sequences of the invention may be cloned into a 
vector. The vector may be introduced into a cell. The cell may be a prokaryotic cell or 
a eukaryotic cell. In a preferred format, the cell is a plant cell. 
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The recombinant API promoter sequences may be introduced into a transgenic 
plant. The transgenic plant may be transgenic wheat, barley, rye, oats, or forage 
grasses. The invention is further directed to seed from the transgenic plants of the 
invention. 

The present invention is further directed to a method for altering a plant's 
response to vernalization or its flowering time. A plant's response to vernalization is 
said to be altered when the requirement of vernalization or the effect of vernalization in 
the acceleration of flowering is modified by the expression of a heterologous protein in 
the plant. The method of the invention includes, as a first step, transforming a plant or 
plant tissue with a genetic construct having a recombinant API promoter sequence 
operably linked to a recombinant heterologous protein sequence. The API promoter 
sequence may lack all or a portion of nucleotides -162 to -172 upstream of the start 
codon of SEQ ID NO: 12. In the method of the invention, the recombinant protein 
sequence may be an API protein-encoding sequence or any other useful heterologous 
protein. The method includes, as a second step, expressing the genetic construct In the 
plant to alter the plant's response to vernalization or its flowering time Independently of 
vernalization. The plant may be selected from wheat, barley, rye, oats and forage 
grasses. 

The present invention is further directed to molecular markers for Vrnl derived 
from a gene selected from the group of genes depicted in Figure 1. 

Another aspect of the present invention is the Vrn2 gene and the ZCCTI protein 
produced from the gene. This gene is a repressor of flowering and Its RNA abundance 
decreases during vernalization (Fig. 14). The gene may be in a vector or transgenically 
expressed In plants. The gene is preferably operably linked to a promoter that may be 
an inducible promoter, a regulated promoter, or a constitutive promoter. The ZCCTI 
coding sequences and flanking regulatory sequences of the Invention Include those 
sequences that hybridize under at least low stringency and preferably moderate, high, 
or ver/ high stringency conditions to a nucleic acid selected from SEQ ID NO:74, 75, 78, 
and 79. In another embodiment of the presenting Invention, the ZCCTI coding 
sequences and flanking sequences also Include those sequences with at least 75% 
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sequence identity and preferably at least 80%, at least 85%, at least 90%, or at least . 
95% sequence identity with a nucleotide sequence selected from SEQ ID NO:74, 75, 
78, and 79. The present invention also includes the protein sequences selected from 
SEQ ID NO:76, 77, and 80 as well as protein sequences with at least 75% sequence 
identity and preferably at least 80%, at least 85%, at least 90%, or at least 95% 
sequence identity with a protein sequence selected from SEQ ID NO:76, 77, and 80. 
The present invention further includes nucleic acid sequences encoding the above 
protein sequences. 

Yet another aspect of the present invention includes ZCCT-related proteins and 
nucleic acids encoding such proteins. The ZCCT-related proteins are proteins with 
structural homology to ZCCTl proteins that have at least one ZCCTl activity including 
the ability to repress expression of API in temperate grasses, the ability to interfere 
with the endogenous ZCCTl activity such as by competitively binding the ZCCTl DNA 
binding site or having the repressor activity of ZCCTl. Nucleic acids encoding ZCCT- 
related proteins may be in a vector or transgenically expressed in plants. Such nucleic 
acids are preferably operably linked to a promoter that may be an inducible promoter, a 
regulated promoter, or a constitutive promoter. The ZCCT-related protein coding 
sequences and flanking regulatory sequences of the invention include those sequences 
that hybridize under at least low stringency and preferably moderate, high, or very high 
stringency conditions to a nucleic acid selected from SEQ ID NO:74, 75, 78, 79, 81, 82, 
84, 85, 87, 88, 90, and 91. In another embodiment of the presenting Invention, the 
ZCCT-related protein coding sequences and flanking sequences also include those 
sequences with at least 75% sequence identity and preferably at least 80%, at least 
85%, at least 90%, or at least 95% sequence identity with a nucleotide sequence 
selected from SEQ ID NO: 74, 75, 78, 79, 81, 82, 84, 85, 87, 88, 90, and 91. The 
present invention also Includes the protein sequences selected from SEQ ID NO:76, 77, 
80, 83, 86, 89, and 92 as well as protein sequences with at least 75% sequence identity 
and preferably at least 80%, at least 85%, at least 90%, or at least 95% sequence 
identity with a protein sequence selected from SEQ ID NO: 76, 77, 80, 83, 86, 89, and 
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92. The present invention further includes nucleic acid sequences encoding the above 
protein sequences. 

In still another aspect, the promoter regions from ZCCTl or ZCCT -related protein 
encoding genes as described above may be operably linked to a heterologous gene. 
Such constructs may be in a vector. The vector may be introduced into a cell. The cell 
may be a prokaryotic cell or a eukaryotic cell. In a preferred format, the cell is a plant 
cell. 

In another aspect, flowering in wheat or other temperate grasses may be 
regulated by stimuli other than vernalization. This may be achieved by replacement of 
the endogenous API gene .with an API gene operably linked to an inducible promoter. 
Thus, expression of the API gene may be induced In response to exposure to a 
particular stimulus such as pathogen exposure, wounding, heat exposure, chemical 
exposure, etc. so that the plant will flower at a controlled time or under certain 
conditions. In addition, controlled flowering may be achieved by addition of a ZCCT- 
related protein coding gene operably linked to an inducible promoter. Then removal of 
the stimulus that Increases expression of the ZCCTl repressor can stimulate flowering 
by derepression of API. In yet another embodiment, the expression of the API gene or 
the ZCCTl gene may be regulated by RNAi or antisense gene operably linked to an 
inducible promoter. 

In yet another aspect of the present Invention, a plant that normally requires 
vernalization, such as winter wheat, may be modified to no longer require vernalization 
in order to flower. Such plants may be generated by a number of methods. In one 
embodiment, the plant may be supplied with an API promoter that is not repressed 
prior to vernalization operably linked to an API gene. In another embodiment, the 
plants endogenous ZCCTl activity may be inhibited. The ZCCTl activity may be 
Inhibited by a wide variety of methods. Examples include repression with RNAi or 
antisense gene expression, knockout of the ZCCTl gene or promoter, overexpression of 
a repression defective ZCCT-related protein that competes with the endogenous ZCCTl 
for the ZCCTl DNA binding site, overexpression of a DNA binding defective ZCCT- 
related protein that competes with the endogenous ZCCTl for associated proteins 
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Involved In repressing API, or replacement of the endogenous ZCCTl protein with a 
defective ZCCTl protein by homologous recombination for example. 

In still another aspect, plants that never flower may be generated for use as 
forage or in situations where flowing is not desired such as golf courses. Such plants 
may be generated by expression of a ZCCT-related protein operably linked to a 
constitutive promoter. In another embodiment, the API activity may be permanently 
repressed by RNAI or antisense gene expression. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 depicts physical maps of the VRNl regions of various plants. A depicts a 
genetic map of the VRNl region on chromosome SA'" of T. monococcum. Genetic 
distances are in cM (6,190 gametes). B-D depict physical maps of the colllnear l//?/Vi 
regions in rice, sorghum, and wheat. Regions indicated in red have been sequenced. 
Double dot lines indicate gaps In the current physical maps. B shows the sequence of 
the colllnear region In rice chromosome 3. C shows 5. b/co/orBPXjs 170F8 (AF503433) 
and 17E12 (AY188330). D shows a T. monococcum physical map. BAC clones order 
from left to right Is: 49116, 115G1, 136F13, 133P9, 116F2, 89E14, 160C18, 491M20, 
32803, 609E6, 393011, 719C13, 454P4, 54K21, 579P2, 601A24, 231A16, 638J12, 
52F19, 242A12, 668L22, 539M19, and 309P20 (bold letters Indicate sequenced BACs). 
Black dots Indicate validation of BAC connections by hybridization. E shows the gene 
structure of two MADS-box genes completely linked to the VRNl gene (AY188331, 
AY188333). Bars represent exons. F shows the sequence comparison of the API 
promoter regions from genotypes carrying the Vrnl and vml alleles, and from two T. 
monococcum accessions with additional deletions (SEQ ID NO: 1-4). Deletions in the 
promoters in SEQ ID 2 and 3 have been found to be completely linked to the Vrnl allele 
for spring growth habit. Linkage between SEQ ID 4 and the Vrnl allele for spring 
growth habit has not been determined yet. Numbers indicate distances from the start 
codon. A putative MADS-box protein-binding site (CArG-box) Is highlighted. 
Figure 2A Identifies plants with critical crossovers flanking VRNl. A: homozygous for 
G1777 (vrnl); B: homozygous for G2528 ( l^mi); H: heterozygous. X: crossover 
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between two markers. F2 Vrnl genotype Inferred from F3 progeny test. The names of 
marker genes are the same as indicated in FIg.l. Rgure 2B depicts Progeny tests for 
plants with critical recombination events as determined by the closest heterozygous 
molecular marker to API. "N" indicates the number of plants In each class. "D" 
indicates the range of heading dates of the unvernalized plants with a particular 
genotype after the heading date of the of control spring parent G2528 used as zero. 
Figure 3 depicts the relationship between wheat proteins API and AGLGl and other 
plant MADS-box proteins in a Neighbor-joining tree. Confidence values on the branches 
are based on 1000 bootstraps. Tm= Triticum monococcum, Hv= Hordeum vulgare, Os= 
Oryza sativa, At= Arabidopsis thaliana. 

Figure 4A shows a Reverse Transcription Polymerase Chain Reaction (RT PCR) 
experiment using T. monococcum G3116 (winter growth habit) and API, AGLGl and 
/4C77/V specific primers. The PCR reactions for the three genes were performed using 
the same cDNA samples. Leaves 1-5): Leaves 1) Before vernalization; 2-4) 2, 4 and 6 
weeks of vernalization; 5) two weeks after vernalized plants were returned to the 
greenhouse; 6) unvernalized apices; 7) 6-weeks vernalized apices; 8) young spikes. 
Rgure 48 shows the API transcription levels in leaves relative to >4C77/V measured by 
quantitative PCR. 1-5) Leaves from plants at the same vernalization stage as samples 1- 
5 in 4A. Units are linearized values using the 2^'^^ method, where CT is the threshold 
cycle. 

Figure 5 shows API transcription in leaves of different age. The numbers on the x- 
axis represent different leaves from the same tiller from the youngest (1) to the oldest 
(5). The numbers on the y-axis represent linearized values using the 2^'"^^ method, 
where CT is the threshold cycle 

Figure 6 shows a restriction map of sorghum SAC 17E12. The horizontal lines Indicate 
the fragment detected by hybridization of the Southern blots of the restriction maps 
with the probe indicated above. 

Figure 7 depicts allelic variation In the diploid wheat API DNA sequences. The bolded 
and underlined nucleotide Indicates the only polymorphism in the coding region. Figure 
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7A depicts the sequence of G2528 {vrni) = DV92 {vrni) (SEQ ID N0:5). Figure 7B 
depicts the sequence of G1777 (m7i) = G3116 (i/mi) (SEQ ID N0:6). 
Figure 8 depicts allelic variation in the wheat API protein sequence. The bolded and 
underlined amino acid indicates a difference in the sequence. Figure 8A depicts the 
sequence of G2528 {vrnl) = DV92 (vrnl) (SEQ ID N0:7). Figure SB depicts the 
sequence of Gl777{vml) = G3116 {vml) (SEQ ID N0:8). 
Figures 9A and 9B depict allelic variation in the API promoter region. G2528: Vml 
allele (SEQ ID N0:9), DV92= G1777= G3116= vrnl allele (SEQ ID NO:10-12); 
riUPUfftwi End of the 401-bp repetitive element; GA host duplication created by the 
Insertion of the repetitive element Higliliglited: differences between Vml and vml 
genotypes. Underlined : 5' UTR based on alignment with ESTs BF429319 and BF484655 
Figure 10 depicts a gel showing the 34 and 48 bp deletions in the API promoter 
region. Lines 1, 3, 4, and 5 (PI355516, PI352473, PI272561, PI573529): cultivated T. 
monococcum accessions with winter growth habit. Line 2 (PI349049): 34-dp deletion, 
spring growth habit. Line 6 (PI355515): 48-bp deletion, spring growth habit. Line 7: 
G3116, vml. Lines 8: G1777, vml. Line 9: G2528, Vml (20-bp deletion). 
Figure 11 shows the sequence alignment of the VRN-1 promoter sequences from 
hexaploid, tetraploid and diploid wheat. Section of the VRN-l promoter amplified with 
genome-specific primers. The recessive vrn-Al (SEQ ID NO: 109), vrn-Bl (SEQ ID NO: 
110) and vrn-Dl (SEQ ID NO: 111) were amplified from genomes-A, -B and -D of the 
common wheat winter Triple Dirk line. Vrn-Ala and Vrn-Alb are two duplicated regions 
of the VRN-Al promoter including the large and short foldback elements respectively. 
This duplication was from the spring Triple Dirk line (SEQ ID NO: 112 and SEQ ID 
NO: 113). This duplication with the inserted foldback elements was found in spring 
cultivar Anza and In approximately half of the spring varieties from Argentina and from 
California. On the top line, only the sequence of the inserted foldback element is 
indicated (the rest of the sequence is identical to vrn-Al). The 9-bp host duplication is 
highlighted in light blue. Bases that are part of a perfect inverted repeat are indicated in 
blue. Conserved bases between the two duplications are underlined. Outside the 
foldback element, the 9-bp host duplication is highlighted in light blue and the CArG- 
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boxl In green, but its putative variable border In red. Vrn-Alc: Marquis (PI 94548, SEQ 
ID NO: 114), Vrn-Ald: T. dicoccoides (accession FA15-3, SEQ ID NO: 115) and Vrn-Ale 
from T. timopheevii, (SEQ ID NO: 116). Alleles from T. monococcum are indicated by 
the Am genome (blue letters). The winter recessive vrn-Aml allele was from accession 
G1777 (SEQ ID NO: 13). The alleles for spring growth habit are Vrn-A'"la: G2528 (SEQ 
ID NO: 14), Vrn-A^'lb: PI349049 (SEQ ID NO: 15), Vrn-A'"lc: PI355515 (SEQ ID NO: 16), 
and Vm-A"'ld: PI 503874 (SEQ ID NO: 17). The transcriptional initiation sites were 
deduced based on the sequence of the wheat EST clone and are indicated in pink 
highlight (vrn-Al, vrn-Bl and vrn-Dl). A variable TC-rich repetitive region of the UTR is 
underlined. Deletions are Indicated by dashes highlighted in yellow. Polymorphic bases 
are indicated in red. 

Figure 12 shows a model of the regulation of flowering initiation by vernalization in 
diploid wheat. 

Figure 13A shows a genetic map of the VRN2 region on chromosome SA"" of T. 
monococcum based on 5,698 gametes. Numbers of crossovers in the critical 
recombinant plants are indicated In boxes. Figure 13B shows a physical map of the 
wheat l//?yV2gene region in T. monococcum and in colinear regions from barley and 
rice. BAC clones indicated in red have been sequenced (438,828-bp, AF459088). BAG 
clones order from left to right are: 374A18, 94E8, 304H18, 258C22, 301G15, 61506, 
650N20, 405L8, 271011, 275P20, 157P20, 455C17, 322L23, 702K8, 32A1, 533H16 
and 324G2 (bold letters indicate sequenced BACs). 

Figure 14A shows RT-PCR from leaves of G3116 2-month old plants unvernalized (U) 
or 5 days at room temperature after 6 weeks of vernalization (V). B-E) Quantitative 
PGR. Figure 14B shows transcript levels in leaves of ZCCTl (red scale) and API (blue 
scale) relative to ACTJN'vn G3116 (averages of 5 plants ± SE): 0: before 4°G; 2w, 4w, 
6w: weeks at 4°C; 2w out: 2 weeks In greenhouse after vernalization. Figure 14G 
shows transcript levels in apexes of ZCCTl, ZCCT2an6 API relative to ACUN 'm G3116 
(averages of 3 pools of apexes from 5 plants each ± SE). U= unvernalized, V= 3-5 days 
at room temperature after 6 weeks of vernalization. Figures 14D and 14E show the 
effect of light on transcript levels of ZCCTl relative to UBIQUTTIN'm leaves from 
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unvernalized G3116 (measured every hour, averages of 10 plants ± SE). In Figure 14D, 
plants were moved to the light after 24 h In the dark. In Figure 14E, plants were moved 
to the dark after 8 h in the light. Units for the Y-axis in the quantitative PCR 
experiments (B, C, D, and E) are linearized values using the method, where Ct is 
the threshold cycle. 

Figure 15A shows a transgenic winter wheat transformed with an RNA interference 
construct for ZCCTl which flowered 23 days earlier than the negative control. Figure 
15A bottom shows RT-PCR with primers for the translated PolyA region from the vector 
used in the RNAi transformation. RNA was extracted from transgenic winter wheat 
Jagger transformed with an RNA interference construct for ZCCTl. The primers were 
designed for the translated PolyA region from the vector used in the RNAi 
transformation. Progeny from the transgenic plant (34 positive and 11 negative) 
showed perfect cosegregation of the presence of the transgene and early flowering. 
Figure 15B shows a transgenic spring wheat transformed with an RNA interference 
construct for API which flowered 23 days eariier than the negative control. 
Figure 15C shows RT-PCR with primers for a transcribed region from the vector used 
In the RNAI transformation. RNA was extracted from transgenic spring wheat Bobwhite 
transformed with an RNA interference construct for API. The primers were designed 
for a transcribed region from the vector used in the RNAI transformation. Progeny from 
the transgenic plant (7 positive and 8 negative) showed perfect cosegregation of the 
presence of the transgene and lat flowering. 

Figure 16 shows the Best Neighbor Joining tree based on CCT domains. Bosstrap 
values >50 based on 1000 replications are indicated in their respective nodes. Roman 
numbers In front of the gene names indicate the CC?-//<re group according to Griffiths et 
al. (2003). 

Figure 17 shows the time course expression of ZCCTl under long day and continuous 
light. Samples were extracted from leaves of unvernalized Triticum monococcum 
G3116 every 4 hours. The first 6 samples were extracted from plants located in the 
greenhouse under long day conditions. After the 2AM sampling plants were transferred 
to a growth chamber under continuous light. Values are averages often plants ±SE. 
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Units are linearized values using the 2 ('"^CT) method, where CT is the threshold cycle. 
No significant differences in ZCCTl linearized values were detected among the different 
collection times under continuous light {P= 0.25) and highly significant differences were 
detected under the long day conditions {P< 0.0001). 

Figure 18A shows a comparison of CCT domains from ZCCT and COIike genes. Light 
blue indicates amino acids mainly conserved in the ZCCT genes and green indicates 
amino acids conserved mainly in the CO-like proteins. The arrowhead indicates the 
location of the EI^S mutation in Arabidopsis co-7 and the R to W mutation In DV92. 
Figure 18B shows a comparison of putative Zinc fingers. Underlined indicates primer 
sequence, light blue indicate putative zinc fingers. 

BRIEF DESCRIPTION OF THE SEQUENCE LISTING 

The following list of sequences are grouped according to the nature of the 
sequence. The list does not include sequences used as PGR primers or sequences used 
in sequence comparisons. 

SEQ ID N0:5 is the protein encoding nucleotide sequence of the API from 
wheat DV92. 

SEQ ID N0:6 Is the protein encoding nucleotide sequence of the API from 
wheat G3116. 

SEQ ID N0:7 is the protein sequence of the API from wheat DV92. 
SEQ ID N0:8 is the protein sequence of the API from wheat G3116. 
SEQ ID N0:9 is the nucleotide sequence of the API promoter region from 
wheat G2528. 

SEQ ID NO: 10 is the nucleotide sequence of the API promoter region from 
wheat DV92. 

SEQ ID N0:11 is the nucleotide sequence of the API promoter region from 
wheat G1777. 

SEQ ID N0:12 is the nucleotide sequence of the API promoter region from 
wheat G3116. 
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SEQ ID NOs: 13-17 are the nucleotide sequences from the API promoter 
regions from T. monococcum including winter recessive G1777 and spring growth 
accessions G2528, PI 349049, PI355515, and PI503874. 

SEQ ID NOs:109-116 are the nucleotide sequences from the API promoter 
regions of the genomes-A, -B, and -D of the winter Triple Diric line, the spring cultivar 
Anza (duplication of promoter regions with SEQ ID 112 and 113), Marquis PI94548, T. 
t/ADOcco/flfeis (Accession FA15-3), and T. timopheevii. 

SEQ ID NO: 18 is the protein encoding nucleotide sequence of the API from 

barley. 

SEQ ID N0:19 is the protein sequence of the API from barley. 

SEQ ID N0:20 is the protein sequence of the API from hexaploid wheat. 

SEQ ID N0:21 is the protein sequence of the API from Lolium temulentum. 

SEQ ID NO: 22 is the protein encoding nucleotide sequence of the API from 
Lolium temulentum. 

SEQ ID NO:23 is the nucleotide sequence of the CArG-box from the API 
promoter. 

SEQ ID N0:74 is the genomic DNA sequence including the promoter region of 
ZCCTl from T. monococcum DV92, 

SEQ ID NO:75 is the predicted cDNA sequence of ZCCTl from T. monococcum 

DV92. 

SEQ ID NO:76 is the protein sequence of a nonfunctional ZCCTl with a R to W 
mutation from T. monococcum DV92. 

SEQ ID NO: 77 is the protein sequence of a functional ZCCTl from T. 
monococcum G3 1 16. 

SEQ ID NO:78 is the genomic DNA sequence including the promoter region of 
ZCCTl from Langdon (tetraplold wheat). 

SEQ ID NO:79 is the predicted cDNA sequence of ZCCTl from Langdon 
(tetraplold wheat). 

SEQ ID NO:80 is the protein sequence ZCCTl from Langdon (tetraplold wheat). 
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SEQ ID N0:81 is the genomic DNA sequence Including tlie promoter region of 
ZCCT2 from T. monococcum DV92. 

SEQ ID N0:82 Is the predicted cDNA sequence of ZCCr2 from T. monococcum 

DM91. 

SEQ ID NO:83 is the protein sequence of ZCCT2 from T. monococcum DV92. 

SEQ ID N0:84 is the genomic DNA sequence including the promoter region of 
ZCCr2 from Langdon (tetraplold wheat). 

SEQ ID NO:85 Is the predicted cDNA sequence of ZCCT2 from Langdon 
(tetraplold wheat). 

SEQ ID NO:86 Is the protein sequence of ZCCT2 from Langdon (tetraplold 
wheat). 

SEQ ID N0:87 Is the genomic DNA sequence Including the promoter region of 
ZCCT-Ha from winter barley (Dairol<kal<u). 

SEQ ID NO:88 is the predicted cDNA sequence of ZCCT-Ha from winter barley 
(Dairokkaku). 

SEQ ID NO:89 is the protein sequence of ZCCT-Ha from winter barley 
(Dairokkaku). 

SEQ ID NO:90 is the genomic DNA sequence Including the promoter region of 
ZCCT-Hb from winter barley (Dairokkaku). 

SEQ ID N0:91 Is the predicted cDNA sequence of ZCCT-Hb from winter barley 
(Dairokkaku). 

SEQ ID NO:92 is the protein sequence of ZCCT-Hb from winter barley 
(Dairokkaku). 

DETAILED DESCRIPTION OF THE INVENTION 

Before explaining at least one embodiment of the invention in detail, it is to be 
understood that the Invention Is not limited in Its application to the details of 
construction and the arrangement of the components set forth In the following 
description. The invention Is capable of other embodiments or of being practiced or 
carried out In various ways. Also, it is to be understood that the phraseology and 



13 



terminology employed herein Is for the purpose of description and should not be 
regarded as limiting. 

Throughout this disclosure, various publications, patents and published patent 
specifications are referenced. The disclosures of these publications, patents and 
published patent specifications are hereby incorporated by reference Into the present 
disclosure to more fully describe the state of the art to which this Invention pertains. 
The practice of the present invention will employ, unless othen^^lse Indicated, 
conventional techniques of plant breeding, immunology, molecular biology, 
microbiology, cell biology and recombinant DNA, which are within the skill of the art. 
See, e.g., Sambrook et al. (1989), Ausubel et al. (1987), Hayward et al. (1993), Coligan 
et al. (1995), MacPherson et al. (1995), Harlow and Lane (1988) and Freshney (1987). 

Unless otherwise noted, technical terms are used according to conventional 
usage. Definitions of common terms in molecular biology may be found In Lewin 
(1994); Kendrew etal. (1994); Meyers (1995); Ausubel etal. (1987); and Sambrook et 
al. (1989). 

In order to facilitate review of the various embodiments of the Invention, the 
following definitions are provided: 

API protein or API polypeptide: An API protein or API polypeptide Is a 
protein encoded by the floral merlstem identity gene APETALAl (API). In Arabidopsis, 
mutations in API result In replacement of a few basal flowers by Inflorescence shoots 
that are not subtended by leaves. An apical flower produced In an apl mutant has an 
Indeterminate structure In which a flower arises within a flower. 

The present invention may be practiced using nucleic acid sequences that encode 
full length API proteins as well as API derived proteins that retain API activity. The 
preferred API proteins are wheat derived. API derived proteins which retain API 
biological activity Include fragments of API, generated either by chemical (e.g. 
enzymatic) digestion or genetic engineering means; chemically functionallzed protein 
molecules obtained starting with the exemplified protein or nucleic acid sequences, and 
protein sequence variants, for example allelic variants and mutational variants, such as 
those produced by in vitro mutagenesis techniques, such as gene shuffling (Stemmer et 
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al., 1994a, 1994b). Thus, the term "API protein" encompasses full-length API proteins, 
as well as such API derived proteins that retain API activity. 

Representative but non-limiting API sequences useful in the invention include the 
wheat API DNA sequences depicted in Figures 7A and 7B and the corresponding 
protein sequences depicted in Figures 8A and 8B. 



Also encompassed within the definition of API sequences include the barley API 
protein (BM5 AJ249144) encoded by the following sequence: 



ATGGGGCGCAGGAAGGTGCAGCrGAAGCGGATCGAGAACAAGATCAACCGCCAGGTCACCTTCTCCAAGCGC 

CGCrCGGGGCTGCTCAAGAAGGCGCACGAGATCTCCGTGCrCTACGACGCCGAGGTCGGCCTCATCATCrTCr 

CCACCAAGGGAAAGCTCTACGAGTTCTCCACCGAGTCATGTATGGACAAAATTCTTGAACGGTATGAGCGCTA 

CTCTTATGC^GAAAAGGnrTCTCGTTTCAAGTGAATCTGAAATTCAGGGAAACTGGTGTCACGAATATAGGAAA 

CTGAAGGCGAAGGTTGAGACAATAC^GAAATGTCAAAAGCATCTCATGGGAGAGGATCTTGAATCTTTGAATC 

TCAAGGAGTTGCAGCAACTGGAGCAGCAGCTGGAAAGCTCACrGAAACATATCAGAGCCAGGAAGAACCAACT 

TATGCACGAATCCATTTGTGAGCTTCAGAAGAAGGAGAGGTCACTGCAGGAGGAGAATAAAGTTCTCCAGAAG 

GAACTTGTGGAGAAGCAGAAGGCCG\GGCGGCGCAGCAAGATCAAACTCAGCCTCAAACCAGCrLI ICI ICI I 

(XreCTTCATGATGAGGGATGCTCCCCCTGTCGCAGATACCAGCAATCACCCAGCGGCGGCAGGCGAGAGGG 

CAGAGGATGTGGCAGTGC^GCCTCAGGTCCCACTCCGGACGGCGCTTCCACTGTGGATGGTGAGCCACATCA 

ACGGCTGA (SEQ ID NO: 18) 



The corresponding barley API protein (Hv BM5 CAB97352.1 AJ249144) sequence is: 

MGRRKVQLKRIENKINRQVTFSKRRSGLLKKAHEISVLYDAEVGLIIFSTKGKLYEFSTESCMDKILERYERYSYAEK 
VLVSSESEIQGNWCHEYRKLKAKVETIQKCQKHLMGEDLESLNLKELQQLEQQLESSLKHIRARKNQLMHESISEL 
QKKERSLQEENKVLQKELVEKQKAQAAQQDQTQPQTSSSSSSFMMRDAPPVADTSNHPAAAGERAEDVAVQPQ 
VPLRTALPLWMVSHING (SEQ ID NO: 19) 



Also encompassed within the definition of API sequences include the hexaploid 
wheat API protein (Ta API BAA33457 MADS) sequence is: 



MGRGKVQLKRIENKINRQVTFSKRRSGLLKKAHEISVLCDAEVGUIFSTKGKLYEFSTESCMDKILERYERYSYAE 
KVLVSSESEIQGNWCHEYRKLKAKVEnQKCQKHLMGEDLESLNLKELQQLEQQLESSLKHIRSRKNQLMHESISE 
LQKKERSLQEENKVLQKELVEKQKAQAAQQDQTQPQTSSSSSSFMMRDAPPAAATSIHPAAAGERAGDAAVQPQ 
APPRTGLPLWMVSHING (SEQ ID NO: 20) 



Included within the definition of API sequences for this invention is the Lolium 
temulentum API protein sequence which is: 

MGRGKVQLKRIENKINRQVTFSKRRSGLLKKAHEISVLCDAEVGLIIFSTKGKLYEFATDSCMDKILERYERYSYAE 
KVUSTESEIQGNWCHEYRKLKAKVEnQRCQKHLMGEDLESLNLKELQQLEQQLESSLKHIRSRKSQLMHESISE 
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LQKKERSLQEENKILQKELIEKQKAHTQQAQLEQTQPQTSSSSSSFMMGEATPATNRSNPPAAASDRAEDATGQP 
PARTVLPPWMVSHLNNG (SEQ ID NO: 21) 

The corresponding Lolium temulentum API DNA sequence (AF035378) encoding the 
protein sequence Is: 

CrcrCI I CI ICCCCACrGGACGCACGCCATGACACCGGCCCCACGGCrCCACCTGCACCCrCGGGACTAGCCG 

TCGCCGTCGCCGTCCGGGCGGGTTGTCGATTAGGGTTTGGTCTGCTCTTCCAGGGAGGGAGGCGAGATGGG 

GCGCGGCAAGGTGCAGCTCAAGCGGATCGAGAACAAGATCAACCGCCAGGTCACCTTCTCCAAGCGCCGCrC 

AGGCCTGCTCAAGAAGGCGCACGAGATCTCCGTGCTCTGCGACGCAGAGGTCGGGCrCATCATCTTCrCCACC 

AAGGGAAAGCTCTACGAGTTCGCCACCGACTCATGTATGGACAAAATTCrTGAGCGGTATGAGCGCTACTCCT 

ATGCAGAGAAAGTGCTCATTTCMCTGAATCTGAAATrCAGGGAAACrGGTGTCATGAATATAGGAAACTGAA 

GGCGAAGGTTGAGACAATACAGAGATGTCAAAAGCATCTAATGGGAGAGGATCTTGAATCATTGAATCrCAAG 

GAGTTGCAGCAACTAGAGCAGCAGCTGGAAAGTTD^CTGAAACATATTAGATCCAGAAAGAGCCAGCTTATGC 

ACGAATCCATATCTGAGCXTCAAAAGAAGGAGAGGTCACTGCAAGAGGAGAATAAAATTCTCCAGAAGGA^ 

CATAGAGAAGOVGAAGGCCCACACGCAGCAAGCGOVGTTGGAGCAAACTCAGCCCCAAACCAGCTCT^ 

TCCTCCTTTATGATGGGGGAAGCTACCCCAGCAACAAATCGCAGTAATCCCCCAGO\GCGGCCA^^^ 

CAGAGGATGCGACGGGGC>\GCCrCCAGCTCGCACGGTGCTTCCACCATGGATGGTGAGTCACCTCAACAATG 

GCTGAAGGGTCCrrCCACTCCATCTAAACGTATTATTCAGrACGrGTAGCGAGCrGCACCGGCCTGTCTTGTG 

GTTGCCTAGCAAGCTGACCCTCCTGCGTGAGCTGACTrCACGTAAGGrAGCAGGTTGCAATGTGTATATTTCA 

CTCTGTTCTGCTCAGTTTCCCTCCTGCGTGAGCTGACTTCACGTAAGAGTTATTTAACn-GrAATACATGT^^ 

GCGTGAGTGAC^TTTTCCTVCTrTCTACGACCCTCTTGGGTACCGTCTGTTTCrGTGAATrAAAC^^ 

ATCAGTATTATGTATATTGTGATTGTTGAAAAAAAAAAAAA (SEQ ID NO: 22) 

The coding region start and stop sites are bold and underlined. 

The maize and Arabidopsis API sequences are also included within the definition of API 
protein and are disclosed in US Patent No. 6,355,863 which Is hereby incorporated by 
reference. 

API Promoter: An API promoter Is a promoter for the APETALAl (API) gene. 
API promoters are generally found 5' to the API protein coding sequence and regulate 
expression of the API gene. API promoter sequences as defined herein include those 
sequences that hybridize under high stringency conditions to the nucleic acid of SEQ ID 
NO: 12 (Figure 9). Such sequences can be synthesized chemically or they can be 
Isolated from plants. API promoters can be spring or winter API promoters, for 
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example, spring wheat or winter wheat API promoters. Representative plants from 
which API promoters can be Isolated Include wheat (spring and winter), barley, rye, 
tritlcale, oat and forage grasses. A spring API promoter sequence as defined herein 
Includes nucleic acids that hybridize to the nucleic acid molecule of SEQ ID NO: 12 or 
the complement thereof under high stringency conditions wherein the API promoter 
sequence lacks all or a portion of nucleotides -162 to -172 upstream of the starting 
ATG, CCrCGTTTTGG (SEQ ID NO:23) or has Similar deletions to those Indicated In SEQ ID 
NO: 14, 15, 16, 17, 112, and 114. An API promoter sequence Is said to lack all or a 
portion of SEQ ID NO:23 If 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 of the nucleotides of SEQ ID 
NO:23 are missing, changed or altered. A spring API promoter sequence also Includes 
any Insertion or deletion In the proximity of the CArG box that alter Its binding with the 
repressor (e.g. SEQ ID NO: 112 and 113). A winter API promoter sequence as defined 
herein includes nucleic acids that hybridize to the nucleic acid of SEQ ID NO: 12 or the 
complement thereof under high stringency conditions and that is transcriptionally up 
regulated by vernalization. Also Included in the definition of API promoter are additional 
natural or synthetic sequences that might not hybridize with SEQ ID NO: 12 but that 
Include the CArG box CCTCGTTTTGG or a related motif that act as a recognition site for 
the vernalization signal. 

Vernalization: Vernalization is the exposure of plants to cold to trigger 
flowering. For example, winter wheats typically require 4 to 8 weeks at 4° C to flower. 

Sequence Identity: The similarity between two nucleic acid sequences, or two 
amino acid sequences Is expressed In terms of sequence identity (or, for proteins, also 
in terms of sequence similarity). Sequence Identity is frequently measured in terms of 
percentage Identity; the higher the percentage, the more similar the two sequences 
are. As described herein, homologs and variants of the API nucleic acid molecules may 
be used In the present invention. Homologs and variants of these nucleic acid 
molecules will possess a relatively high degree of sequence Identity when aligned using 
standard methods. Such homologs and variants will hybridize under high stringency 
conditions to one another. 
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Methods of alignment of sequences for comparison are well known in the art. 
Various programs and alignment algorithms are described in: Smith and Waterman 
(1981); Needleman and Wunsch (1970); Pearson and Lipman (1988); HIggins and 
Sharp (1988); Higgins and Sharp (1989); Corpet etal. (1988); Huang etal. (1992); and 
Pearson etal. (1994). Altschul etal. (1994) presents a detailed consideration of 
sequence alignment methods and homology calculations. 

The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul etal., 1990) is 
available from several sources, including the National Center for Biotechnology 
Information (NCBI, Bethesda, MD) and on the Internet, for use in connection with the 
sequence analysis programs blastp, blastn, blastx, tbiastn and tbiastx. It can be 
accessed at the NCBI Website. A description of how to determine sequence identity 
using this program is available at the NCBI website. 

Homologs of the disclosed protein and nucleic acid sequences are typically 
characterized by possession of at least 40% sequence identity counted over the full 
length alignment with the amino acid sequence of the disclosed sequence using the 
NCBI Blast 2.0, gapped blastp set to default parameters. The adjustable parameters 
are preferably set with the following values: overlap span 1, overlap fraction = 0.125, 
word threshold (T) = 11. The HSP S and HSP S2 parameters are dynamic values and 
are established by the program itself depending upon the composition of the particular 
sequence and composition of the particular database against which the sequence of 
interest is being searched; however, the values may be adjusted to increase sensitivity. 
Proteins with even greater similarity to the reference sequences will show Increasing 
percentage identities when assessed by this method, such as at least about 50%, at 
least about 60%, at least about 70%, at least about 75%, at least about 80%, at least 
about 90% or at least about 95% sequence Identity. 

The alignment may Include the Introduction of gaps in the sequences to be 
aligned. In addition, for sequences which contain either more or fewer ariiino acids 
than the protein encoded by the sequences in the figures, it is understood that in one 
embodiment, the percentage of sequence identity will be determined based on the 
number of identical amino acids In relation to the total number of amino acids. Thus, 
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for example, sequence Identity of sequences shorter than that shown in the figures as 
discussed t)elow, will be determined using the number of amino acids in the longer 
sequence, in one embodiment. In percent identity calculations relative weight is not 
assigned to various manifestations of sequence variation, such as. Insertions, deletions, 
substitutions, etc 

In one embodiment, only identities are scored positively (+1) and all forms of 
sequence variation Including gaps are assigned a value of "0", which obviates the need 
for a weighted scale or parameters as described herein for sequence similarity 
calculations. Percent sequence identity can be calculated, for example, by dividing the 
number of matching Identical residues by the total number of residues of the "shorter" 
sequence In the aligned region and multiplying by 100. The "longer" sequence is the 
one having the most actual residues in the aligned region. 

As will be appreciated by those skilled in the art, the sequences of the present 
invention may contain sequencing errors. That is, there may be incorrect nucleotides, 
frameshifts, unknown nucleotides, or other types of sequencing errors In any of the 
sequences; however, the correct sequences will fall within the homology and stringency 
definitions herein. 

Very High Stringency: Very high stringency conditions refers to hybridization 
to filter-bound DNA in 5 X SSC, 2% sodium dodecyl sulfate (SDS), 100 ug/ml single 
stranded DNA at 55-65 ° C for 8 hours, and washing in 0.1 X SSC and 0.1% SDS at 60- 
65 ° C for thirty minutes. 

High Stringency: High stringency conditions refers to hybridization to filter- 
bound DNA in 5 X SSC, 2% sodium dodecyl sulfate (SDS), 100 ug/ml single stranded 
DNA at 55-65 ° C for 8 hours, and washing in 0.2 X SSC and 0.2% SDS at 60-65 " C for 
thirty minutes. 

Moderate Stringency: Moderate stringency conditions refers to hybridization to 

filter-bound DNA in 5 X SSC, 2% sodium dodecyl sulfate (SDS), 100 ug/ml single 
stranded DNA at 55-65 ° C for 8 hours, and washing In 0.2 X SSC and 0.2% SDS at 50- 
55 ° C for thirty minutes. 
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Low Stringency: Low stringency conditions refers to hybridization to filter- 
bound DNA in 5 X SSC, 2% sodium dodecyl sulfate (SDS), 100 ug/ml single stranded 
DNA at 55-65 °.C for 8 hours, and washing in 2.0 X SSC and 0.2% SDS at 50-55 ° C for 
thirty minutes. 

Vector: A nucleic acid molecule as introduced into a host cell, thereby 
producing a transformed host cell. A vector may Include one or more nucleic acid 
sequences that permit it to replicate in one or more host cells, such as origin(s) of 
replication. A vector may also include one or more selectable marker genes and other 
genetic elements known in the art. 

Transformed: A transformed cell is a cell into which has been introduced a 
nucleic acid molecule by molecular biology techniques. As used herein, the term 
transformation encompasses all techniques by which a nucleic acid molecule might be 
introduced into such a cell, plant or animal cell. Including transfection with viral vectors, 
transformation by Agrobacterium, with plasmid vectors, and introduction of naked DNA 
by electroporation, llpofectlon, and particle gun acceleration and includes transient as 
well as stable transformants. 

Isolated: An "Isolated" biological component (such as a nucleic acid or protein 
or organelle) has been substantially separated or purified away from other biological 
components in the cell or the organism in which the component naturally occurs, I.e., 
other chromosomal and extra-chromosomal DNA and RNA, proteins and organelles. 
Nucleic acids and proteins that have been "isolated" include nucleic acids and proteins 
purified by standard purification methods. The term embraces nucleic acids including 
chemically synthesized nucleic acids and also embraces proteins prepared by 
recombinant expression in vitro or in a host cell and recombinant nucleic acids as 
defined below. As an example, a gene In a large fragment such as a contig Is not 
sufficiently purified away from other biological components to be considered Isolated 
due to the relatively large amount of extra DNA found in the average contig. 

Operably linked: A first nucleic acid sequence is operably linked with a second 
nucleic acid sequence when the first nucleic acid sequence Is placed In a functional 
relationship with the second nucleic acid sequence. For Instance, a promoter Is 
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operably linked to a protein coding sequence if tlie promoter affects tiie transcription or 
expression of tlie protein coding sequence. Generally, operably linked DNA sequences 
are contiguous and, where necessary, join two protein-coding regions in the same 
reading frame. With respect to polypeptides, two polypeptide sequences may be 
operably linked by covalent linkage, such as through peptide bonds or disulfide bonds. 

Recombinant: By "recombinant nucleic acid" herein is meant a nucleic acid 
that has a sequence that is not naturally occurring or has a sequence that is made by 
an artificial combination of two otherwise separated segments of sequence. This 
artificial combination is often accomplished by chemical synthesis or, more commonly, 
by the artificial manipulation of nucleic acids, e.g., by genetic engineering techniques, 
such as by the manipulation of at least one nucleic acid by a restriction enzyme, llgase, 
recombinase, and/or a polymerase. Once introduced into a host cell, a recombinant 
nucleic acid is replicated by the host cell, however, the recombinant nucleic acid once 
replicated in the cell remains a recombinant nucleic acid for purposes of this Invention. 
By "recombinant protein" herein is meant a protein produced by a method employing a 
recombinant nucleic acid. As outlined above "recombinant nucleic acids" and 
"recombinant proteins" also are "Isolated" as described above. A gene in a large 
fragment such as a contig would not be a "recombinant nucleic acid" given that artificial 
combination does not relate to the gene. However, if sequences around or within a 
gene in a contig have been manipulated for purposes relating to that gene (I.e., not 
merely because the gene is near the end of the contig), then such a gene in a contig 
would constitute a "recombinant nucleic acid" due to the relative proximity of the 
recombinant portion of the nucleic acid to the gene in question. 

Non-naturally Occurring Plant: A non-naturaily occurring plant is a plant that 
does not occur in nature without human intervention. Non-naturally occurring plants 
include transgenic plants and plants produced by non transgenic means such as plant 
breeding. 

Transgenic plant: As used herein, this term refers to a plant or tree that 
contains recombinant genetic material not normally found in plants or trees of this type 
and which has been introduced into the plant In question (or into progenitors of the 
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plant) by human manipulation. For the avoidance of doubt, introduction of a nucleic 
acid isolated from a plant or tree bacl< into the plant or tree by human manipulation still 
generates a transgenic plant. Thus, a plant that is grown from a plant cell into which 
recombinant DNA is introduced by transformation is a transgenic plant, as are all 
offspring of that plant that contain the introduced transgene (whether produced 
sexually or asexually). It is understood that the term transgenic plant encompasses the 
entire plant or tree and parts of the plant or tree, for instance grains, seeds, flowers, 
leaves, roots, fruit, pollen, stems etc 

Inducible promoter: As used herein, this term refers to any promoter 
functional in a plant that may provide differential expression levels in response to 
externally supplied stimuli. This includes both promoters that increase expression and 
promoters that decrease expression in response to stimuli or changed external 
conditions. 

External stimuli that may effect transcription by inducible promoters include, 
without limitation, pathogen attack, anaerobic conditions, the presence or absence of 
light, heat or cold stress, osmotic stress, toxic metal stress, steroid responsive 
promoters, and chemically inducible promoters. Examples of inducible promoters are 
the AdhI promoter, which is inducible by hypoxia or cold stress, the Hsp70 promoter, 
which is inducible by heat stress, and the PPDK promoter, which is inducible by light. 
Examples of pathogen-inducible promoters include those from proteins, which are 
induced following Infection by a pathogen; e.g., PR proteins, SAR proteins, beta-1,3- 
glucanase, chltinase, etc. See, for example, Redplfl et al. (1983); Uknes et al. (1992); 
Van Loon (1985); PCT Publication No. WO 99/43819. 

Also of interest are promoters that are expressed locally at or near the site of 
pathogen infection. See, for example, Marlneau, et ai. (1987); Matton, et al. (1987); 
Somssich et al. (1986); Somsslch et al. (1988); Yang (1996). See also, Chen, et al. 
(1996); Zhang and Sing (1994); Warner et al. (1993), and Siebertz et al. (1989), all of 
which are herein incorporated by reference. 

Additionally, inducible promoters include wound inducible promoters. Such 
wound inducible promoters include potato proteinase inhibitor (pin II) gene (Ryan 
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(1990); Duan et al. (1996)); wunl and wun 2, U.S. Pat. No. 5,428,148; winl and win2 
(Stanford et al. (1989)); systemin (McGurl et al. (1992)); WIPl (Rohmeier et al. (1993); 
Eckelkamp et al. (1993)); MPI gene (Cordero et al. (1994)); and the like, herein" 
Incorporated by reference. 

Both heterologous and non-heterologous (i.e. endogenous) promoters can be 
employed to direct expression of the nucleic acids of the present invention. These 
promoters can also be used, for example, in recombinant expression cassettes to drive 
expression of antisense nucleic acids to reduce, increase, or alter concentration and/or 
composition of the proteins of the present invention in a desired tissue. Thus, in some 
embodiments, the nucleic acid construct will comprise a promoter functional in a plant 
cell, such as in wheat, operably linked to a polynucleotide of the present invention. 
Promoters useful In these embodiments include the endogenous promoters driving 
expression of a polypeptide of the present Invention. 

Regulated promoter: As used herein, this term refers to any promoter 
functional in a plant that provides differential expression levels in response to stimuli 
internal to the plant such as developmental signals. This includes both promoters that 
increase expression and promoters that decrease expression in response to stimuli or 
changed external conditions. Many promoters that are regulated promoters are also 
inducible promoters. For example, promoters that are responsive to auxin are both 
because they will change levels of expression in response to developmental changes in 
auxin levels and In response to externally supplied auxin. 

Examples of regulated promoters under developmental control include promoters 
that initiate transcription only, or preferentially, in certain tissues, such as leaves, roots, 
fruit, seeds, or flowers. Exemplary promoters include the anther specific promoter 5126 
(U.S. Pat. Nos. 5,689,049 and 5,689,051), glob-1 promoter, and gamma-zein promoter. 
An exemplary promoter for leaf- and stalk-preferred expression Is MS8-15 (WO 
98/00533). Examples of seed-preferred promoters included, but are not limited to, 27 
kD gamma zein promoter and waxy promoter (Boronat et al. (1986); Reina et al. 
(1990); and Kloesgen et al. (1986)). Promoters that express in the embryo, pericarp, 
and endospemi are disclosed In U.S. applications Ser. No. 60/097,233 filed Aug. 20, 
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1998 and U.S. applications Ser. No. 60/098,230 filed.Aug. 28, 1998 both of which are 
hereby incorporated by reference. The operation of a promoter may also vary 
depending on its location in the genome. Thus, a developmentally regulated promoter 
may become fully or partially constitutive in certain locations. A developmentally 
regulated promoter can also be modified/ if necessary, for weak expression. 

Ortholog: Two nucleotide or amino acid sequences are orthologs of each other 
if they share a common ancestral sequence and diverged when a species carrying that 
ancestral sequence split into two species, sub-species, or cultivars. Orthologous 
sequences are also homologous sequences. Orthologous sequences hybridize to one 
another under high-stringency conditions. The term "polynucleotide", "oligonucleotide", 
or "nucleic acid" refers to a polymeric form of nucleotides of any length, either 
deoxyrlbonucleotides or ribonucleotides, or analogs thereof. The terms "polynucleotide" 
and "nucleotide" as used herein are used interchangeably. Polynucleotides may have 
any three-dimensional structure, and may perform any function, known or unknown. 
The following are non-limiting examples of polynucleotides: a gene or gene fragment, 
exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, ribozymes, 
cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, vectors, 
isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid probes, and 
primers. A polynucleotide may comprise modified nucleotides, such as methylated 
nucleotides and nucleotide analogs. If present, modifications to the nucleotide 
structure may be Imparted before or after assembly of the polymer. The sequence of 
nucleotides may be interrupted by non-nucleotide components. A polynucleotide may 
be further modified after polymerization, such as by conjugation with a labeling 
component. A "fragment" or "segment" of a nucleic acid is a small piece of that nucleic 
acid. 

ZCCTl protein or ZCCTl polypeptide: A ZCCTl protein or ZCCTl 

polypeptide is a protein encoded by the VRN2qene in temperate grasses. In wheat, 
deletions or mutations in ZCCTl result in a shift from a winter wheat phenotype, which 
requires vernalization, to a spring wheat phenotype, which does not need vernalization 
in order to flower. 
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The present invention may be practiced using nucleic acid sequences tiiat encode 
full length ZCCTl proteins as well as ZCCTl derived proteins that retain ZCCTl activity. 
The ZCCTl activity depends upon the intended use of the ZCCTl derived proteins. For 
example, ZCCTl activity may be full activity in its ability to repress expression of API in 
temperate grasses. ZCCTl activity may also be the ability of the ZCCTl derived protein 
to interfere with the endogenous ZCCTl activity, which could include ZCCTl DNA 
binding activity without repression so that the ZCCTl derived protein competes with the 
endogenous ZCCTl for its DNA binding site. By competing with the endogenous 
ZCCTl, the fragment could be used to prevent the endogenous ZCCTl protein from 
repressing the API gene and any other gene that the ZCCTl protein may repress. Also, 
ZCCTl activity could be the repressor activity without the DNA binding activity. 
Overexpression of the repressor will interfere with the endogenous ZCCTl activity by 
competing with accessory proteins that bind to ZCCTl and enable repression of API. 
This again would prevent the endogenous ZCCTl protein from repressing the API gene 
and any other genes that ZCCTl represses. ZCCTl derived proteins which retain ZCCTl 
biological activity include fragments of ZCCTl, generated either by chemical (e.g. 
enzymatic) digestion or genetic engineering means; chemically functlonalized protein 
molecules obtained starting with the exemplified protein or nucleic acid sequences, and 
protein sequence variants, for example allelic variants and mutational variants, such as 
those produced by In vitro mutagenesis techniques, such as gene shuffling (Stemmer, 
1994a, 1994b). Thus, the term "ZCCTl protein" encompasses full-length ZCCTl 
proteins, as well as such ZCCTl derived proteins that retain the desired ZCCTl activity. 

Representative but non-limiting ZCCTl sequences useful in the invention include 
the wheat ZCCTl DNA sequences and the corresponding protein sequences. 

Examples of such sequences include the genomic ZCCTl DNA sequence from T. 
monococcum DV92 encoded by the following sequence (Exon 1 and 2 are in bold with 
the stop codon underlined): 

GAAAGCGGTGCCCACCCACGCAACCATCGCCAACATCGACTCCGGCCGAACACAAGCTTTTGCCTAGACCTAGACAC 

AACCGCTCTACCAAACCCTCGGAATAGTCTGGCCTAGCTAACAACCAACATCGCCCCTCCCTACTCCGACCAAACAT 

GATGAAGTACTAACTGACAATGCCTCCAAGGAGTAGAATGGCATCCTCAAGCGTCATCATCGCCGAAGAATCAAACT 

TTCGCCCAGAGCCTTAOVCCTCTCaVCa^TCa^CAGACCCCAAGGAAGGAGCaUVGCTAa^ACGAACy^TGTGAG^^ 

AGCCGCGACTCCACTCGATCCAACAACAACC5AACCAGTCCAGTTGGATCAGAAGGACACCACGACCACCTGTATCAC 
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CATGGTAGGCTACCTGTGTCGATGCCAACACCAACCCAACACTACAACCAGGATGCCTGGCAAATGCAGTCCTTGCC 
GTCGAGGAGAACGCGGGCCCAAAATGCCCCCCTAGCCACAAATTGCCCACTATGATAAAAAAACACAAGCCAGAGCC 
GCTGGTAAGTAGTCGCCTTGTGTCGTGGGCCACATAGGCTCACATGCACCGACGGCGGCACCTAAGCCGCCATGAGC 
CACCACCGCGCACAAGAGCAGCAGACACCTAGACCAGGCTGACATGAGACCTGTCACCACTACATGTTGTCGCATCA 
CAACAGGCCATCAGACATCAGTAGGTCGCCGCACCTCAGAAGCACCTCCGTCCCGCAGCCAACACGAAGCACCCAAT 
TCGAGATCCTTCAGCAAACTCCCCTACGTGCGAGTGGCCGCGAGCCGACGAGAAGAGTAGGCCACGCCACCTCCGGA 
TCGGCGCGGGCTTCACTAGGCAGAGCCCATTGGCAATAGCGAGGGGAGGGGGGGCGGTCTAAGAGGGATTGAGCCGC 
CGCCCGGGTGGACCCGGGGAGGGAACAATATTGCAAGTTTTTTTTACAATGAGCACACACCCAGTCTCTGCATAGTT 
AGGATGATATAGCCAACACCAACTCACACACATACGAAAACACGCTGACAACTAGCAACGTCACATAAGACCAATGA 
TGTTCGTTGGCAAGAAAAAAATGCCGCAAAACAATCAGATTTGTGATTGACAAACTACAATAATGACCATATCCACA 
CCAATCATCTCATAAAACCACACTGACGATGAGGTCTTCGATAGCAATGCCTTTAGGAAGGGAGCGACACTCAAGCA 
CCACCATCACTAGATCCAACCACAAGGCCAAAATGTAGGTTTTCATCCCGAAGAATCAGTCCAAGCATATTCGAGCA 
ATGCATTCGACAAGGTAAGAATGTAAGAAAAACATCGCCTTTTCCAGGTATAAACTGTTGGTTCTGACCTAGGCTTT 
TGCCCCTGAGGTCGAGACCGGGTGCTCGAGTAGCAACACCATCGAAGTCGCTCATGTGTTGTCATCACCACTTTTCC 
GTGATTCTAGCAGCAACATGTGATGCAACTGCTGCCACTGCACACCCATCCCTTTGCATCAAGCCGTCGTCCATAAT 
TTGTATCTCATCACTGAAGTTAACCACTGGATCAGGAGAGATGACCCCTCCCAGGGACCATTCAATGACCACTGCCG 
TCGTGGAGTCCTAGGAAGTAGTTGCAGTATAGTTTGCAGCAACACCATCTGGCAGGTCAGATCTGGATCACCACCAC 
CAACCCATGGATCTTAGCGCCGCCAACCAGCCACCGCACACGTGGAAAGCCAGCACCTCGCAGCAACAGTCCTCATC 
CAATAGAAAQACCGACGCCAAATCCGATCGGATCTTGCCGGAGTCCACCCACATAGTGAGTCACCACCACAATAAGG 
GACGCGCCTCTAGCCATGGTAGCGAGTGACCCGCACAACCAGTCTCACCTGCCGTGAAGAAATATGCGTGCTCCCGG 
TTGCTGTAGGATGGGACATATGGTTCCCATGACAAGGGAGCGGTAGAGGGACGAGAGGATGTGAATATAAATATATT 
TTTTTGAAAAGGGGGATATCCCCCAGCGTGTGCATCCAAAAGATGCATGTGACCATATTATTAAAGCAGTTGCAGCA 
AAAAACAAGGTCTGGTATCCAAATATATCTCGCAAAAGGAGCAAAAATAAGTAAAAGCTAAGAAAGATAAACATAGC 
CACAACCCGCAGGATGAGAGGATTAGAATATGCCAAACGAAAAAAAGATGGGATGCGGAAGCATCCTGCCGCCGCCG 
TCCGGCCACCGGTGGTTTCCTCTCGCAACAGCGAGGGGTTGGGTGGCTGGTGGGGGGAGGGTTGGGTGGGGAGGGTT 
GGGTGTGGCAGCGTGGGCACCTCCGGAGTCGACAACGGCGATCCAGGAGGCCGGTTAAGCTTGGGGGAGAAGAGTCC 
TCGCTGGTTCTCATTCTAGAGTTTAGTTTCCATGCCCATGATAATAGCATGGATGCCCCATGACGAAAATTGTTTCA 
CAGCTGGTAGTACTTTTCTATTTTAGTATTGGCATGGTTTCCATTTTGTTGTTTTTGTCTCCCTCGGACTTTTGTGT 
TAGCATCTCCTTTTTGTTTTGACGCTGACCAAAAAAAGCTACACAAATATCTAGCAGTGGCCTTGTGTGGACATAAG 
ATCATGTGGGGGATTCCCGGCAAGCAAGGTCTGCATGGCTCCGGCTCCTCCGCGTAAGAAAGAAAGAAATCAACGAT 
GGATCGAGGGATCATATCTATTCCGACCCACTCATTAGTTGGGTCTATTTGATTTGATCTATCATATTTTGATAGTT 
GCCATATCGAATCTTTTTTCTGGCCTGAGAGCTCACGGCTGCCTATATGCAGTGCATGTGAGAGAGACACAGTACGG 
CCCTAGCTACTACTACAAGTACCTTGGTAGTTACTGGTACTCATAACTGCCTCTTCTTCTTCCTCGACGTCTCTCCT 
CCTCGGCTCCTCCACGCACCAGACCACACCAGAAAAAACAAACAAGCAAGCAAACCTTGGAGCTAGCTAGCAGTATG 
TCCATGTCATGCGGTTTGTGCGGCGCCAACAACTGCCCGCGCCTCATGGTCTCGCCCATTCACCATCATCATCACCA 
TCATCAGGAGCACCAGCTGTGTGAGTACCAGTTCTTC6CCCATGGCAACCACCACCACCACCACCATGGCTCGGCAG 
CAGACTACCCAGTGCCACCGCCGCCAGACAACTTCGACCACCGCAGAACATG6ACCAGACCATTTCATGAAACAGCA 
GCGGCAGGGAACAGCAGCAGGCTCACGCTGGAGGTGGGCGCAGGCGGCCAACACATGGCTCACCTAGTGCAGCCACC 
6GCAAGAGCCCACATCGTAAGTAGTACTACTGCTTAATTGTTTCATCTCTTGCCGATGGATGCGTCCATGGCTTCCT 
CCTTAAAAATCCCCACCTAATTAATGTCCATCTGACTACACCCACTACAAAAAAGTAGCACCATGTAACCATTTCAT 
ATATTTCTCACATAATTCTGTTAATTTACGCTGCTCGATTGTTCTCCTGAA/UU^GATATACGGGAATGGATCTGGAT 
ATTCTTTAATTTTCTATGGAGGCATAGAGTTTGTGTTTTGTATTAGTTGATGCAGAATTGTATGGGTTGTCAAATCA 
TCAGTCATACATATATACTTATTTCTTTTTTTTTGACCAACAAGAAGGTAATCAGTCATACATGCATACTGAAAATT 
AGACTTGTGTGCAATAACTAACTAACCAACTCGACCGGCACAGCTGGGGGAAGACTTTAATCAAGCTGCTAGCTAGA 
GCTTAATAATATAACATATCTCTTTATGGGATCAAGCAATACATATGCGCTCAATTCTCAACTTGTCAATATCTATC 
TGGAGTCCACACTTTATGGTAATTAATTGACAAAGTTTTGTGAAATGGACAATATACATACTGGATCGATGCACCCT 
TTTTCTCATTTTATGTGGTCATTATAATTGATTGTTATTTAGTATTTCAATTTTATCTTGAGCTAGTTTTGCAAGTC 
TGTAGCTCATATATAACTGATACTACTCCCCACGATAGCTTGCGTAGTGGCCGGGTGATCGATCTACCGAGTTCATA 
AAACTGATCGAGATCGGGTCCAAAAAAGAACAAACCCATACAAAATGGAAAGAAGATCCTTGTTTAGTTAGTTTGCA 
TCAGAAAATTGCCTAATTAGTTACTTGCTATCAATCTTTTGAACATGGCATGTTCACCCCAAACGGACCCAGATCAC 
AATTATTGATGAAGTTACGCCTTTTAAAAACTCATAAAACTGTACATACATGTACAGGGCTACACACATGTACATAA 
TACACCTAATTAAAACGTATATTTGTAGACCAATTGATTTTGGACGGTGCGCATCTTTGGAAAAAAAATGCCAGAGG 
AGTTGTTAGCTTCCACTGTCCAGAAATAGAATAGTTACAATCAAGTGCATCTCTGAATGAAAATGGATCATTTTCTA 
GTTAATTAGAGACCAATTAGATACTTCATAAACAGGGGAGTATCAAGTACGTATCTGCTACCCATAAGAAAGTACAT 
AACTGCGATCTTATGATTATTTTCCTCTTGATGTTCAGGTGCCATTTCACGGAGGTGCATTCACCAACACTATTAGC 
AATGAAGCAATCATGACTATTGACACAGAGATGATGGT66GGCCTGCCCATTATCCCACAATGCAGGAGAGAGCAGC 
GAAGGT6AT6A6GTATAGGGAGAAGA6GAAGAGGCGGCGCTATGACAA6CAAATCCGATAC6AGTCCAGAAAAGCTT 
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ACGCTGAGCTTCGGCCATGGGTCAACGGCCGCTTTGTCAAGGTACCCGAAGCCATGGCATCGCCATCATCTCCAGCT 
TCGCCCTATGATCCTAGTAAACTTCACCTCGGATGGTTCCGG TAAT TTATAGCACAAGCCAGATAAAATGATAACAT 

ATTTCCTTCTGATTGATCCACCCGTGAAGCAGTTGTTCCTCAAAGTAAAATAAGTCGGTTAGTGATTGATCGATTGG 

AGCCATTATGTTGACTTGACTATTTAAAATGGTCAGCAGATCAATCAAACAAAATGTATTTATTGAAACAAGTCTTG 

TTATACTACGTGTTGATTTAAACATGTAATTTCAAGAGGATAGCTACTTTGATGTGTAATAAAATTGTCTCAAAATT 

GGTGACAAGTGCGATTGTTGTTGTGATTTATATGGAATTATGTCAATCATACTGGAAA/^TAATATGTAACCAGTTG 

AATTAAGTCATCGCCGACTCAAAATTAAATACGAATGAGGCTTTTATGTATAAAGTTTGTTATTTTATCCTGAGGAC 

TTCTCTAGGGGTGGAAAACGGATTGAGAAAGTACACCATCACGATTGCCGAACATGTACAATGCtTATCTTGAGAAA 

GAAAATTATATTTCATTCACCAAATATGAGGTGT^CCTTGCAACCACATGTATATTAAAAAGCTATGTGTCAGCTAA 

CTAATTTGTGGACTTATCATAGGTTAAATACCTCCAATGTGTACGAATGAGGAACTTGAGTAGAATATGTGAAGTTG 

CATGGAAAACTGTGAACATATCAAATTATCAAGACATCACTACAGATGTACATCATCCGAAGTTCATGTATTATATT 

GAAATTGTGTGTTCCTTATGTTGTTGGATGTACTTATTGAAGTGATCCTTCATCTATGAGGTAAGTATTAATTAATT 

TGTCCATCGTTTGATCAATCATGTGTATTTAATTAGTTTGTTGGATGTACTAAGTTTTAATTAGTTTGTTGGATGTA 

CTGAGTATTAATTAGTTTGTCCATCCCAAGCTTCATCTATAACCCAATGACAAAGGTGACAACGCTATGCACACATA 

CATACACATGATTGATCAAACGATGGACAAACTACCAATGACAAAGGTGACAACGCTATGCACACATACATACACAA 

ATAGATGAAGCTTGGAATACGGCACGGGTGGCAGCCTCGAACCTCAAGCTGGTGCGAACCCAATGAGCTAGACAAGA 

TAACAATGTCTGTCCAAGATAAGGGGAAGCTATTGTTGACTGCCATGATCTAATAGGTTGCCAAATACTAATTGTCA 

TGAGATTTATTTAGTCCAATGTGTTGGGTTTAGTCCCACTTCAGTTGTGGGGGGAGACATAACATGATTTATAAGGG 

CAGGGGTGTCCCCTCCTAACAGGCTAGTCCTTTGGAGGGAGAGGGCCCAAGACCTCTCATAAGTCGGTGtTACTCTC 

TGGTTGAGCCTGGTTGACGCATGTGGGTCGGAAATGCTAGTGTTAGCGGACCCGAAATTGCGTAACAAGTGGTACCA 

TGAGCTAGGTTGTTCGAGGTTGCGATTGTTAATTCAGAGGAGGTCGTGTTCAATCGACGGAGGCGCTGCGAGGGTTT 

GCCAGGTGCCTGCGGTGAGATCGAAGTTGTCCGATTGAAGGTCAACGGAAAGATCAAAAGCAGTCAAAGCTGGATCG 

GTTTGGCAGAGGCAGCAGGCCGAGCAAGCTGCGGGGACGCGCAGGCCAAGCAGGACCCAAGTGCGGGTGACTGGCCT 

GGTAAAGCACGCGGCTGCTAGTAGACAGGAGCGCGTATGCGGCAGCCAGACAAGCGGATGCGGTCTGCGATGTGGCA 

CTTGGCAGAGGCTGTTTGGTACGTGGTTGGCTCAAGGAATGGCCTATTGATGCTGCTGGTGTGAGCACAAAGAGGCG 

AGATTTCACGGGCAAGTTTGGCTGATTCTTGGAGAAGCGGTGTTGCATATAAAAGTCTGTTTGTTGATGTGCTTAAA 

GACTGTACC7UVGAAGCAAGTTGACGGGTCAAAAAAAAGAAGCAAGTTGAGAACATTACCAAGGGTAG 

AGGACGACAACGTGCGATAAGGCTTGAAGCAAGAGTCTGGAGCTTGTCGTGGCAGGATCATGCATACACAGGGCGTC 

AAGCAATGCACGGAGATGTGCGGGGCGGACACGGCGCAGAGTGGAGCATGGTTGCAGTCGAACATACACGGCTGGCT 

TGGACTGGATTGTTTAATGGACTGGCATGGAGGTTGGTCAAAGC (SEQ ID NO: 74) 

Also encompassed within the definition of ZCCTl DNA sequences is the ZCCTl 
protein coding cDNA sequence from T. monococcum DV92 encoded by the following 
sequence (the coding sequence is in bold with the stop codon underlined): 

TAACTGCCTCTTCTTCTTCCTCGACGTCTCTCCTCCTCGGCTCCTCCACGCACCAGACCACACCAGAAAAAACAAAC 
AAGCAAGCAAACCTTGGAGCTAGCTAGCAGTATGTCCATGTCATGCGGTTTGTGCGGCGCCAACAACTGCCCGCGCC 
TCATGGTCTCGCCCATTCACCATCATCATCACCATCATCAGGAGCACCAGCTGTGTGAGTACCAGTTCTTCGCCCAT 
GGCAACCACCACCACCACCACCATGGCTCGGCAGCAGACTACCCAGTGCCACCGCCGCCAGACAACTTCGACCACCG 
CAGAACATGGACCAGACCATTTCATGAAACAGCA6CGGCAGG6AACA6CAGCAGGCTCACGCT6GAGGTGGGCGCAG 
GCGGCCAACACAT6GCTCACCTAGTGCAGCCACCGGCAAGAGCCCACATCGTGCCATTTCACGGAGGTGCATTCACC 
AACACTATTAGCAATGAAGCAATCATGACTATTGACACAGAGATGATGGTGGGGCCTGCCCATTATCCCACAATGCA 
GGAGAGAGCAGCGAAGGTGATGAGGTATAGGGAGAAGAGGAAGAGGCGGCGCTATGACAAGCAAATCCGATACGAGT 
CCAGAAAAGCTTACGCTGAGCTTCGGCCATG6GTCAAC6GCCGCTTTGTCAAGGTACCCGAAGCCATGGCATCGCCA 
TCATCTCCAGCTTCGCCCTATGATCCTAGTAAACTTCACCTCGGATGGTTCCGGTAATTTATAGCACAAGCCAGATA 
AAATGATAACATATTTCCTTCTGATTGATCCACCCGTGAAGCAGTTGTTCCTCAAAGTAAAATAAGTCGGTTAGTGA 
TTGATCGATTGGAGCCATTATGTTGACTTGACTATTTAAAATGGTCAGCAGATCAATCAAACAAAATGTATTTATTG 
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AAACAAGTCTTGTTATACTACGTGTTGATTTAAACATGTAATTTCAAGAGGATAGCTACTTTGATGTGTAAT 
(SEQ ID NO: 75) 



The sequence of the T. monococcum DV92 ZCCTl protein is as follows (Non- 
functional with R to W mutation in bold). 

MSMSCGLCGANNCPRLMVSPIHHHHHHHQEHQLCEYQFFAHGNHHHHHHGSAADYPVPPPPDNFDHRRTWTRPFHET 
AAAGNSSRLTLEVGAGGQHMAHLVQPPARAHIVPFHGGAFTOTISNEAIMTIDTEMMVGPAHYPTMQERAAKVMRYR 
EKRKRRRYDKQIRYESRKAYAELRPWVNGRFVKVPEAMASPSSPASPYDPSKLHLGWFR* (SEQ ID NO: 
76) 



The protein sequence of the T. monococcum G3116 functional winter allele of 
ZCCTl is: 

MSMSCGLCGANNCPRLMVSPIHHHHHHHQEHQLCEYQFFAHGNHHHHHHGSAADYPVPPPPDNFDHRRTWTRPFHETA 
AAGNSSRLTLEVGAGGQHMAHLVQPPARAHIVPFYGGAFTNTISNEAIMTIDTEMIWGPAHYPTMQERA^ 
RKRRRYDKQIRYESRKAYAELRPRVNGRFVK (SEQ ID NO: 77) 



Also encompassed within the definition of ZCCTl DNA sequences are the 
genomic ZCCTl DNA sequence from Langdon (tetraplold wheat) ZCCT-Al encoded by 
the following sequence (Exon 1 and 2 are in bold with the stop codon underlined): 

GAGTTTAGTTTCCATGCCCATGATAATAGCATGGATGCCCCATGACGAAAATTGTTTCACAGCTGGTAGTACTTTTC 
TATTTTAGTATTGGCATGGTTTCCATTTTGTTGTTTTTGTCTCCCTCGGACTTTTGTGTTAGCATCTCCTTTTTGTT 
TTGACGCTGACCAAAAAAAGCTACACAAATATCTAGCAGTGGCCTTGTGTGGACATAAGATCATGTGGGGGATTCCC 
AGCAAGCAAGGTCTGCATGGCTCCGGCTCCTCCGCGTAAGAAAGAAAGAAATCAACGATGGATCGAGGGATCATATC 
TATTCCGACCCACTCATTAGTTGAGCAATATTTTGATAGTTGCCATATCGAATATTTTTTCTGGCCTGAGAGCTCAC 
GGCTGCCTATATGCAGTGCATGTGAGAGAGACACAGTACGGCCCTAGCTACTACTACAAGTACCTTGGTAGTTACTG 
GTACTCATAACTGCCTCTTCTTCTTCCTCGACATCTCTCCTCCTCGGCTTCTCCACGCACCAGACCACAGCAGAAAA 
AACAAAAAAGCAAGCAAACCTTGGAGCTAGCTAGCAGTATGTCCATGTCATGCGGTTT6TGCGGCGCCAACAACTGC 
CCGCGCCTCATGGTCTCGCCCATTCATCATCGTCATCACCATCATCAGGAGCACCAGCTGCGTCAGCACCAGTTCTT 
CGCCCAAGGCAACCACCACCACCACCACCCAGTGCCACTGCCGCCAGCCAACTTCGACCATAGCAGAACATGGACCA 
CACCATTTCATGAAACAGCAGCTGCAGGGAACAGCAGCAGGCTCACGCTGGAGGTGGGCGCAGGCGGCCGACCCATG 
GCTCACCTAGT6CAGCCACCG6CAAGAGCCCACATCGTAAGTAGTAGTACCGCTTAATTGTTTCATCTCTTGCCGAT 
GGATGCGTCCCTGGCTTCCTCCTTAAAAATCCCCACCTAATTTATGTCCATCTATACCCACTACAAAAAAATAGCAC 
CATGTAACCATCTCATATATCTGTCACATAATTCTGTTAATGTACGCTGCTCAATTGTTCTCCTGAAAAAGATATGC 
GGGAATGGATCTTGATATTCTTTAATTTTCTATGGAGGCATATATAGAGTTTGTGTTTTGTATTAGTTGATGCAGAA 
TTGTATGGGTTGTCAAATCATCAGTCATACATATAAACTTATTTCATTTTATTTGACCAACAACAAGGTAATCAGTC 
ATACATGCATACTGAAAATTTGACTTGTGTTCAATAACTAACCAACTCGACCGGCACAGCTGGGGGAAGACTTTAAT 
CAAGCTGCTAGCTAGAGCTTAATAATATAACATATCTCTTTATGGGATCAAGCAATACATATGCGCTCAATTCTCAA 
CTTGTCAATATCTATCTGGAGTCCACACTTTATGGTAATTAATTGACAAAGTTTTGTGAAATGGACAATATACATAC 
TGGATCGATGCACCCTTTTTCTCATTTTATGTGGTCATTATGAATTTGATTGTTATTTAGTATTTCAATTTTATCTT 
GAGCTAGTTTTGCAAGTCTGTAGCTCATATATAACTGATACTACTCCCCACGATAGCTTGCGTAGTGGCCGGGTGAT 
CGATCTACCGAGTTCATAAAACTGATCGAGATCGGGTCCAAAAAAGAACAAACCCATACAAAATGGAAAGAAGATCC 
TTGTTTAGTTAGTTTGCATCAGAAAATTGCCTAATTAGTTACTTGCTATCAATCTTTTGAACATGGCATGTTCACCC 
CAAACGGACTCAGATCACAATTATTGATGAAGTTACGCCTTTTAAAAACTCATAAAACTGTACATACATGTACAGGG 
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CTACACACATGTACATAATACACCTAATTAAAACGTATATTCGTAGACCAATTGTTTTGGACGGTGCACATCTTTGA 
AAAAAAATGCCAGAGGAGTTGTTAGCTTCCACTGTCCAGAAATAGAATAGTTACAATCAAGTGCATCTCTGAATGAA 
AATGGATCATTTTCTAGTTAATTAGAGACCAATTAGATACTTCATAAACAGGGGAGTATCAAGTACGTATCTGCTAC 
CCTAAGAAAGTACATAACTGCGATCTTATGATTATTTTCCTCTTGATGTTCAGGTGCCATTTTACGGAGGTGCATTC 
ACCAACACTATTAGCAATGAAGCAATCATGACTATTGACACAGAGATGATGGTGGGGCCTGCCCATTATCCCACAAT 
GCAGGAGAGAGCAGCGAAGGTGATGAGGTATAGGGAGAAGAGGAAGAGGCGGCGCTATGACAAGCAAATCAGATACG 
AGTCCAGAAAAGCTTACGCTGAGCTTCGGCCACGGGTCAACGGCTGCTTTGTCAAGGTACCCGAAGCCATGGCGTCG 
CCATCATCTCCAGCTTCGCCCTATGATCCTAGTAAACTTCACCTCGGATGGTTCCG GTAA TTTATAGCACAAGCCAG 
ATAAAATGATAACATATTTCCTTCTGATTGATCCACCCGTGAAGCAGTTGTTCCTCAAAGTAAAATAAGTCGGTTAG 
TGATTGATCGATTGGAGCCATTATGTTGACTTGACTATTTAAAATGGTCAGCAGATCAATCAAACAAAATGTATTTA 
TTGAAACAAGTCTTGTTATACTACGTGTTGATTTAAACATGTAATTTCAAGAGGATAGCTACTTTGATGTGTAATAA 
AATTGTCTCAAAATTGGTGACAAGTGCGATTGTTGTTGTGATTTATATGGAATTATGTCAATCATATTGGAAAAATA 
ATTAACCAGTTGAATTAAGTCATCGCCAACTCAAAATTAAATACGAATGAGGCTTTTATGTATAAAGTTTGTTATTT 
TATCTTGAGGACTTCTCTAGGGGTGGAAAACGGATTGAGAAAGTACACCATCACGATTGCCGAA (SEQ ID NO: 
78) 

Also encompassed within the definition of ZCCTl DNA sequences include the 
ZCCri cDNA sequence from Langdon (tetraploid wheat) ZCCT-Al encoded by the 
following sequence (the protein coding region is in bold with the stop codon 
underlined): 

TAACTGCCTCTTCTTCTTCCTCGACATCTCTCCTCCTCGGCTTCTCCACGCACCAGACCACAGCAGAAAAAACAAAA 
AAGCAAGCAAACCTTGGAGCTAGCTAGCAGTATGTCCATGTCATGCGGTTTGTGCGGCGCCAACAACTGCCCGCGCC 
TCATGGTCTCGCCCATTCATCATCGTCATCACCATCATCAGGAGCACCAGCTGCGTCAGCACCAGTTCTTCGCCCAA 
GGCAACCACCACCACCACCACCCAGTGCCACTGCCGCCA6CCAACTTC6ACCATAGCAGAACATGGACCACACCATT 
TCATGAAACAGCAGCTGCA6GGAACAGCAGCAGGCTCAC6CTGGAGGTG6GCGCA66CGGCCGACCCATGGCTCACC 
TAGTGCAGCCACCGGCAA6AGCCCACATCGTGCCATTTTACGGA6GTGCATTCACCAACACTATTAGCAATGAAGCA 
ATCATGACTATTGACACAGAGATGATGGTGGGGCCTGCCCATTATCCCACAATGCAG6AGAGAGCAGCGAAGGTGAT 
GAGGTATAGGGAGAAGAGGAAGAGGCGGCGCTATGACAAGCAAATCAGATACGAGTCCAGAAAA6CTTACGCTGAGC 
TTCGGCCACGGGTCAACGGCTGCTTTGTCAAGGTACCCGAAGCCATGGCGTCGCCATCATCTCCAGCTTCGCCCtAT 
GATCCTAGTAAACTTCACCTCGGATGGTTCCG GTAA TTTATAGCACAAGCCAGATAAAATGATAACATATTTCCTTC 
TGATTGATCCACCCGTGAAGCAGTTGTTCCTCAAAGTAAAATAAGTCGGTTAGTGATTGATCGATTGGAGCCATTAT 
GTTGACTTGACTATTTAAAATGGTCAGCAGATCAATCAAACAAAATGTATTTATTGAAACAAGTCTTGTTATACTAC 
GTGTTGATTTAAACATGTAATTTCAAGAGGATAGCTACTTTGATGTGTAAT (SEQ ID NO: 79) 

The sequence of the Langdon ZCCT-Al protein (with normal R amino acid in 
bold) is as follows: 

MSMSCGLCGANNCPRLMVSPIHHRHHHHQEHQLRQHQFFAQGNHHHHHPVPLPPANFDHSRTWTTPFHETAAA^^ 
RLTLEVGAGGRPMAHLVQPPARAHIVPFYGGAFTNTISNEAIMTIDTEMMVGPAHYPTMQERAAKVM 
YDKQIRYESRKAYAELRPRVNGCFVKVPEAMASPSSPASPYDPSKLHLGWFR* (SEQ ID NO: 80) 
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ZCCT-related protein or ZCCT-related polypeptide: A ZCCT-related protein 
or ZCCT-related polypeptide is a protein encoded by the W?yV2gene or genes of related 
function in temperate grasses. ZCCT-related proteins are defined by their structural 
homology to ZCCTl proteins and their conserved function. As discussed above, the 
ZCCT-related protein activity depends upon the intended use of the ZCCT-related 
protein. For example, ZCCT-related protein may be full activity in its ability to repress 
expression of API In temperate grasses. ZCCT-related protein activity may also be the 
ability of the ZCCT-related protein to interfere with the endogenous ZCCTl activity, 
which could include ZCCTl DNA binding activity without repression so that the ZCCT- 
related protein competes with the endogenous ZCCTl or ZCCT-related protein for its 
DNA binding site. Also, ZCCT-related protein activity could be the repressor activity 
without the DNA binding activity. Overexpression of the repressor will interfere with the 
endogenous ZCCTl or ZCCT-related protein activity by competing with accessory 
proteins that bind to ZCCTl or ZCCT-related protein and enable repression of API. 

The ZCCT-related proteins Include the wheat ZCCT2 protein. The nucleotide 
sequence for the T. monococcum ZCCTl genomic DNA Including the promoter region 
(2,588 bp upstream from start codon and 1415-bp downstream from stop codon. Exon 
1 and 2 are in bold) is as follows: 

TTTTCACCAAAAGTAATTACCTATGTACTATATGTATACTTTTTTCTGTGTGATTTTCATGACCGATCTTGTCGATAG 
CTAGCTAGGATGAGAGAAGTCAATGCATAATGTGTGTGATATGTAGCTAGCTAGCCAGTCCTAAAAGATCAAACATGT 
AATGGATTGTGCTGTGTGTGTGGCAGTCTGCATCACGCCATCAGATGACAAAAATATTCTCTACCTAGCTAGTTCACA 
TGAACTTGCTGGCCTGCATCTATACATATACACACCAGAACAAAGAATACTTGGCTTTTCTTTCTTTTTGTAGCATAG 
CCTTTGTTTCTACTTCAGGGCAGAAGAAGATGAATCGCCTGTGTGGTGGTCCGAGATAGCTTAGATGCATAGATCTGT 
GTGGTAGCCCGCAGGGTGTGGTGCTGATGAGGTCGCCCACGCGGGCTACTCCTGCGAGCTC7VGGATTGTGTGGACTGT 
CTGGCCCTGCTTTATGCAAAGGAGGAAATGGCGGCGGATTCCAACGCTCCATGGCTCCAGACTCTCCCGCTGCTTCTT 
GCGCTTTTCGCTGCCCAGAAAGATCGACCACATGTGTATCTGGATTGAAATGTGCTAGCTAATGGTTGTTTCGTTCTT 
GATTTTTGCTCCTTTTCATGTGCTCCACATGTACACAATAGAAAGATCTTGAAAACGACTCCGCTGCCGGTGATACCT 
CGATTTGTG6ATCAGAGTTGGTCGa^a^GAC»AGGGATCGATCGAGATATATGCTTGCCy^TGGCaVTCCAAATA(:a.T 
AAATGGTTCCATGTCACCGATGTGCGTGCAAGATAGOW^a^TGATCAAATTGACTGTTTAATCTCTTCGTCrGGT^ 
AATTATGAAACTAGTATATTTTCGAACCCTCAACTATATATTTTGTGTGGTGAACTTTTGTTGGTTCTCy^TTTTACAG 
CTAAATTAATTTCCACGCCCATACGTAACGCTTTGTTACGAGGGGAGGCTCACACAATTGAAAAAGCCTA9CCGCCTT 
TCCCATCGCCTTCCTTTCTCGCCGCCACGCAGGTTCAGGAGCAGCTTACTGGGGAAGTTTTGAAGGTCTTGGGGGGTT 
GTTGTAGCTTGACAAGGAGATGAAGGCTCGAGTGGAGGTTTCCATCAGGGAGTGTTCAACAGCAGCATTTGTGAATCC 
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GGCGGCGCCCGAACCAACCCAACATGGTGAGCTTCTACGGCGGCGCAGCTTGGCTAGGCCATCGGAGAAGATGATGTG 

GATGCATGGCCCTGGAGGGCGGCGAGTTCGATCTCAGGTCCAGACTTGGCGTACAACAGCGTGATAATTTTGTCTCAT 

GCATTCTTTATGCTGTCTTCGTGTTTTGTTAGGTATGGCTTGCCGGCTGCGCTCCCCCCTGTTGAAAGAAATGTTGTC 

CCCGTCTTGCCCCCTGCTGATGTGCTCACCACCGATGGAGGGTGTGTGTTTTTGTGTCTCCGTCGATGGGTCTTCCGG 

GATCCAGACGGTTTAGGTTTTCCATGGATTCGCCCGATTCGGCCAGCTTTCGTGATCTTCAGAGTTTCTACAAGTCCT 

TATCGATGTTCTCTTCTCTGGGGTGGCGGTTTGCTTTGCGGATCACAGTCTCGCCGACGTCTCTTGGTCTGCGTCAAC 

GAGTTCCTACTCGTTGCCTCTGCAAGCTCCTGGGTTTGAAAAAAGGTTCGCTACATCAAGGCCGAGACCCAAAAACAG 

CACCGAGCTTTCATAGTGCGCCGCCGATGTATGCATGACGAAGAAGACTTCGGCACCCTTGAAGTGTTGATTGTAATT 

TTTCTTTGTATATGGCTGTGTTTGTAAGGTCCTATGATTCTTAATATATGGTTTTAAGCCTCTTTGCCAAT^AAAAAGT 

TCAACGCCCATCACATTACTATTTTTATGATGAAGAGTGTTTTTGTTAACTTAAAATGTAGCGTCAAATGGATACAGA 

GCATGATGAGCAACACGCGGCCTTTGCAATACCAACTGTCTGTCCAAACTAAAAAAAAACTGACATATTGACAATGCT 

AAAGTCATATAGGACCAACACTACACCATTAATTGTTCCACAACTTGTACTTTTCTGTTTTAGTATTGCCATGGTTTC 

CACTGTGTTGTGTTTGTCTCCCTCGGACCTTTGTGTTAGCATCTCCTTTTTGTTTGACAGTGACCAAAAAAGCTACAA 

ATATCTAGCAGTGGCCTTGCGTGGACATAAGATCATGTGGGAGATTCCCGGCAAGCAAGGCGTGCATGGGTCCAGCTC 

CTCCGCGTAAGAAAGAAAGAAATAAAAAATGAATCGAGGGGTAGTATCTATTCCGACGCACTCATTAGTTGGGCCTAT 

TTGATTTGATCCATCATCTTTTGCTAATTCTCAGATCGAATCTTTTGCCTGGTCTGCAGCTCACTGCTGCATATATGC 

AGTGCAGTGGAGGAGGGAGAGACACAACACAACCCTAGCTATTTCAAGGTGCCTTAGTAGTTAGTACTCGTCGTCGTC 

TCTTCTTCTTCCTCGACATCTCTCCCCCTCCACGCGCCAAACCACACCAGAAACAAACAAGCAAGCAATCAAACGTAG 

GGGCTAGCTGCAGTATGTCCATGTCATGC6GTTTGTGCGGCGCAAGCGACTGCCCGCACCACATGATCTCGCCCGTT 

CTTCAGCATCAGGAACAACACTGGCTGCGCGAGTACCAGTTCTTCACCCAAGGCCACCACCACCACCACCACGGCGC 

GGCGGCGGACTACCCACCGCCACCGCCACCGTCGGCCAATTGCCACCACTGCAGATCATGGACCACACCGTTTCATGA 

AACAGCAGCTGCAGGGAACAGCAGCAGACTCACGCTGGAGGTAGATGCAGGCGGCCAAAACATGGCTCACCTGCTGC 

AGCCACCGGCACGGCCAAGAACCACCATCGTGAGTAGTACTACTGCTTAATTGTTCCAGCTCTTGCCGATCGCTGGG 

GCCTCCTTGTAACAAAAGTTCCCTTTTACGTAATCTCCATCTACTCCCCCCCCCCCCCCCCCCCCCCCCGCCCCCTCC 

CCGCATCTCAAAAAAAGTTAGCGCCATGTAACCAGCTCATATATCTGTCACATAATTCTGTTAATTTATGCTGGTCAA 

TTATAATCTCCCAAGGCAGAAAGTTTGTGTTTTGTATCAGTTGATGAACAAGAATGGGAACTCACATCATCAGTTACA 

CATACATACTTATTTCATTTTATTTGACTAACAAGGTAATCAGTTAATTCCTTTATGGGAACAAGCAATACATATGTC 

CACGCCTTCATGTTAATTCCTTGACAAAGTTTGTGAAATGGACAATATATATACTGGATCAATGCACCCTCTTTCTCA 

TTTTATGTGGTCATTTATGAATTTTAGTGCTATTTTATATTTAAATTTTCTCTTAAGCTTGTTTTGTAAGCTTATAGC 

TCATGTATAACAGATACTACTCCCCATAATTGCTTCCGTAGTGGCCGGGTGATCAATCTACCGAGTTCATAAAACTGA 

TCGGGATCAGATCCAAAACAGACCAAAACCTCACGAAATAGAAACAAGATCCTTGTTTAATAAGTTTGCACCAGGAAA 

TTGCCTACTTAATTACTTTCTATCAATCTTATGAACATGGCATGTTTCTCACATATGGTGACCCAGATCACAATTGTT 

GACGGAGTTAAACATTTTTAGCAATTCATAAAACCATGCACAGATGTACAGGGCTACGCGTATGCACATACATAATAC 

ACCTAATTAAAACATATATTCATAGAGCGATTGAGTTTGGACTGTGCGCTTCTTTGGACACAAAGGCCCGGGAAGTTG 

TTCTCTTCCATTGTCTAGAAAAATAGAACAGTTACAATCAAGTGCACCACTGAATGAAAATGGGTC/y^CTCTGGTTAA 

TAAGAGACCAACTGTACTTCATAAACAGGGAATATCATGTACATATCTGCAACCCACAGGAAAAGTACAGAGCTGCAC 

TCTTACAGTTATTTTCCTCTTCATGTTCAGGTGCCATTCTGCGGGGCTGCATTCACCAGCACTATTAGCAATGCAACA 

ATCATGACTATTGATACAGAGATGATGGTGGGGGCTGCCCATAATCT6ACGAT6CAGGAGA6A6A6GCGAAG6TGATG 
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AGGTACAGGGAGAAGAGGAAGAGGCGGTGCTATGACAAGCAAATCCGCTACGAGTCCAGAAAAGCTTACGCCGAGCTC 
AGGCCACGGGTCAATGGCTGCTTTGTCAAGGTACCAGAAGCCGCTGCATCGTCGTCACCCCCAGCTTCGCCCTATGAT 
CCTAGTAAACTTCACCTCGGATGGTTCCAG TAG TTTTTCATCAAAGTAAAATAAGTTGGTTATTGATTGACCGACGGG 
AGGAGTTATGTTGATTTGACTATTTCAAAAGGTCAGCAAACCAATCAAAGAAAATGTATTTGTTGAAACAAGTATTGT 
TATGCTTTATGTTAATTTAAGCATGTAATTTGAGGAGGCTAGCTACTTAGATGTGTTTATTGTATGCAACCAATTGAA 
TCAAGTCATCACCAACTCAAAGTTAAATACGGACGAGCTTTTTATGTATAAAGTTGTTGTTTTATCTTGACGACTTAT 
CAAGAGTGAAAAATGGATTGGGAGTTATGAGTAAACCATCACGACCGCCAAAGATGTACAATGCTTATTTTGAGAGAG 
AAAATTATATTTCACTCACCAAATATGAGTTGAACCTTGTAACCACATGTATATTACAAAGCTGTGTGTCACCTAACT 
AATTTGAGGCCTTATCATAGGTAAAATACCTCCAATCTGCACGAATGAGTCACTTTAAAAGAATATGTCACGTTGCAT 
GGAAAACTGTAAACATGTGTAGACAGCATAATATATAGCTGCAAATCATCCAAAGCTTGTGTACTATATTATAATTAT 
GAGTTCCTTATGCTGTTGCATGTACTTATTGAAGTGATCCTTCGTCTATGAGGTAAGTTTGTACATTCATTCATCCAT 
TTAACCTCGCGAATATAGATAGCTAGTTAATTCGTTTGATCGATCATGTGTATGTGTGCGTCCATAGTGTCACCGCCT 
TTATCACTGGGAGATGGCACACGGTGGCAGCCTCCAACCTCAAGCTGGTGCAAACCAGATCAGCTAGACAAGATAACA 
ATGTTTGTCCAAGATAAAGGGAAGCTATTGTTGACTACCATGATCCAGCAAGTTGCCAAATACTAGGGGCCAGTTCTT 
TTGTTGGCTTCCAAAATAAGCTGCCCCCTATCTAGCTTTTTCTAAAAGCCCAACCAAATTAATTTTTTAGAAGCTTAC 
TAATTAAGACTTGATAGTAGGCTTCTAAAAAGTATTTTTGGTTGGGCTTCTAGAATAAGCTGGGTAGGGGGCCGCTTA 
TTCTAGAAGCCCAAAAAACCACTAAAAGAACTGACCCTAATTGTCATGAGATTTATTAAGTCCAAAGCTCGATGGAAG 
TGACTAGATTAATTGTTTGTTCCTAAATTCATGGGCGGATGCCATGGGTGAAGGCAAAGTAAGTTTAACTATATACTT 
AACACTAGTTATCTAATAAGTTAATGCTACTAGCTATTTGTTGATATCATGATAATATTTAGACTGAATTATATTATG 
GAGTGTAAAATTTCACAATATTTCAGCAGCGGCACCCCGGATATTAAGATCCTAGGTCCTTCACCGCCTTAATTAATA 
TCAATTCCCCTGAACAAGTTATTACTTGGTTGTTCCATCTTGTA (SEQ ID NO: 81) 

The predicted cDNA sequence for T. monococcum ZCCr2 is as follows 

ATGTCCATGTCATGCGGTTTGTGCGGCGCAAGCGACTGCCCGCACCACATGATCTCGCCCGTTCTTCAGCATCAGGA 
ACAACACTGGCTGCGCGAGTACCAGTTCTTCACCCAAGGCCACCACCACCACCACCACGGCGCGGCGGCGGACTACC 
CACCGCCACCGCCACCGTCGGCCAATTGCCACCACTGCAGATCATGGACCACACCGTTTCATGAAACAGCAGCTGCA 
GGGAACAGCAGCAGACTCACGCTGGAGGTAGATGCAGGCGGCCAAAACATGGCTCACCTGCTGCAGCCACCGGCACG 
GCCAAGAACCACCATCGTGCCATTCTGCGGGGCTGCATTCACCAGCACTATTAGCAATGCAACAATCATGACTATTG 
ATACAGAGATGATGGTGGGGGCTGCCCATAATCTGACGATGCAGGAGAGAGAGGCGAAGGTGATGAGGTACAGGGAG 
AAGAGGAAGAGGCGGTGCTATGACAAGCAAATCCGCTACGAGTCCAGAAAAGCTTACGCCGAGCTCAGGCCACGGGT 
CAATGGCTGCTTTGTCAAGGTACCAGAAGCCGCTGCATCGTCGTCACCCCCAGCTTCGCCCTATGATCCTAGTAAAC 
TTCACCTCGGATGGTTCCA GTAG (SEQ ID NO: 82) 



The protein sequence for T. monococcum ZCCTl from DV92 is as follows: 

MSMSCGLCGASDCPHHMISPVLQHQEQHWLREYQFFTQGHHHHHHGAAADYPPPPPPSANCHHCRSWTTPFHETAAAG 
NSSRLTLEVDAGGQNMAHLLQPPARPRTTIVPFCGAAFTSTISNATIMTIDTEMIWGAAHNLTMQEREAK^ 
KRRCYDKQIRYESRKAYAELRPRVNGCFVKVPEAAASSSPPASPYDPSKLHLGWFQ* (SEQ ID NO: 83) 
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The genomic DNA sequence for the Langdon ZCCT2 gene is as follows: 

TGTTTTGTTAGGTATGGCGTGCCGGCTGTGCTCCCCCCTGTTGAAAGAAATGTTGTCCCCATCCTGCCCCCTGGCGA 
TGTGCTCACCACCGATGGAGGGTGTGTGTTTTTGTGTCTCCGTCGATGGGTCTTCTGGGATCCGGTCGGTTTAGGTT 
TCCCATGGATTCGCCCGAATTCGGCCAGCTTTCGTGATCTTCAGAGTTTCTACAAGTCCTTATCGACGTTCTCTTCT 
CTGGGGTGGCGGTTTGCTTTGCGGATCACAGTCTCGCCGACGTCTCTTGGTCTGCGTCGACGAGTTCCTACTCGTTG 
CCTCTGCAAGCTCCTGGGTTTCAAAAAAGGTTTGCTACATCAAGGCGQAGACCCAAAGACAGCACCGAGCTTTCATA 
GTGCGCCGCCGATGTATGCATGACGAAGAAGACTTCGGCACCCCTAAAGTTTTGATTGTAATTTTTCTTTGTATATG 
GGTGTATTTGTAAGGTCCTATGATTCTTAATATATGGTTTTAAGCCTCTTTGCCAAAAAAAAAAGTTCAACACCCAT 
CACATTACTATTTTTACGATGAAGAGTGTTTTTGTTAACTTAAAATGTAGCGTCAAATGGATACAGAGCATGATGAG 
CAACACGCGGCCTTTGCAATACCAAGTGTCTGTCCAAACTAAAAAAAACTGACATATTGACAATGCTAAAGTCATAT 
AGGACCAACACTACACCACTAAATGTTCCACAACTTGTACTTTTCTGTTTAGTATTGCCATGGTTTCCATTGTGTTG 
TGTTTGTCTCCCTCGGACCTTTGTGTTAGCATCTCCTTTTTGTTTGACAGTGACCAAAAAAGCTACAAATATCTAGC 
AGTGGCCTTGCGTGGACATAAGATCATGTGGGAGATTCCCGGCAAGCAAGGTGTGCATGGCTCCAGCTCCTCCGCGT 
AAGAAAGAAAGAAATAAAAAATGAATCGAGGGGTAGTATCTATTCCGACGCACTCATTAGTTGGGCCTATTTGATTT 
GATCCATCATCTTTTGCTAATTCTCAGATCGAATCTTTTGCCTGGTCTGCAGCTCACTGCTGCATACATGCAGTGCA 
GTGGAGGAGGGAGAGACACAACACAACCCTAGCTATTTCAAGGTGCCTTAGTAGTTAGTACTCGTCGTTGTCTCTTC 
TTCTTCCTCGACATCTCTCCCCCTCCACGCACCAAACCACACCAGAAACAAACAAGCAAGCAAGCAAACGTAGGAGC 
TAGCTGCAGTATGCCCATGTCATGCGGTTTGTGCGGCGCAAGCGACTGCCCGCACCACATGATCTCGCCCGTTCTTC 
AGCATCAGGAACAACACCGGCT6CGCGAGTACCAGTTCTTCACCCAAGGCCACCACCACCACCACCACGACGCGGCG 
GCGGACTACCCACC6CCACCGCCACCGTCAGCCAATTGCCACCACTGCAGATCATGGACCACACCGTTTCATGAAAC 
AGCAGCTGCA6GGAACAGCAGCAGGCTCACGCTGGAGGTAGACGCAGGC6GCCAAAACATGGCTCACCTGCTGCAGC 
CACCG6CAC6GCCAAGAACCACCATCGTGAGTAGTACTACTGCTTAATTGTTCCAGCTCTTGCCGATCGCTTGGGCC 
TCCTTCTAACAAAAGTTCCCTTTTACGTAATCTCCATCTACTCCCCCCCCCCCCCCCCCCCGGCATCTCAAAAAAAG 
TTAGCGCCATGTAACCAGCTCATATATCTGTCACGTAATTCTGTTAATTTATGCTGGTTGAATATAATCTCCCAAGG 
CAGAGTGTTTGTGTTTTGTATCAGTTGATGCACAAGAATGGGCACTCACATCATCAGTTACACATACATACTTATTT 
CATTTTATTTGACTAACAAGGTAATCAGTTAATTCCTTTATGGGAACAAGCAATACATATGTCCACGCCTTCATGTT 
AATTCCTTGACAAAGTTTGTGAAATGGACAATATATATACTGGATCAGTGCACCATCTTTTTCATTTTATGTGGTCA 
TTTATGAATTTTAGTGCTATTTTGTATTTAAAATTTTCTCTTAAGCTTGTTTTGTAAGCTTATAGCTCAAGTATAAC 
AGATACTACTCCCCATAATTGCTTCCGTAGTGGCCGGGTGATCAATCTACCGAGTTCATAAAACTGATCGAAATCAG 
ATCCAAAACAGACCAAAACCTCACGAAATAGAAACAAGATCCTTGTTTAATTAGTTTGCACCAGGAAATTGCCTACT 
TAATTACTTTCTATCAATCTTATGAAGATGGTATGTTTCTCACATATGGTGATCCAGATCACAATTGTTGACGGAGT 
TAAACATTTTTAGCAATTCATAAAACCGTGCACAGATGTACAGGGCTACGCGTATGCACATACATAATACACCTAAT 
TAAAACATATATTCATAGAGCGATTGAGTTTGGACTGTGCGCTTCTTTGGACACAAAGGCCCGGGAAGTTGTTCTCT 
TCCATTGTCTAGT^AAAATAGAACAGTTACAATCAAGTGCACCACTGAATGAAAATGGGTCAATTCTGGTTAATAAGA 
GACCAACTGTACTTCATAAACAGGGAATATCATGTACATATCTGCAACCCACAGGAAAAGTACAGAACTGCACTCTT 
ACGATTATTTTCCTCTTCATGTTCAGGTGCCATTCTGCGGGGCTGCATTCACCAGCACTATTAGCAATGCAACGATC 
ATGACTATTGATACAGAGATGATGGTGGGG6CT6CCCATAATCTGACGATGCAG6AGAGA6AGGCGAAG6T6ATGAG 
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GTACAGGGAGAAGAGGAAGAGGCGGTGCTATGACAAGCAAATCCGCTATGAGTCCAGAAAAGCTTACGCCGAGCTCA 

GGCCACGGGTCAATGGCCGCTTTGTCAAGGTACCAGAAGCCGCTGCATCGTCGTCACCCCCAGCTTCGCCCTATGAT 

CCTAGTAAACTTCACCTCGGATGGTTCCGGTAGTTTTTCATCAAAGTAAAATAAGTTGGTTATTGTTTGACCGATGG 

GAGGAGTTATGTTGATTTGACTATTTCAAAAGGTCAGCAGACCAATCAAAGAAAATGTATTTGTTGAAACAAGTATT 

GTTATGCTTTATGTTAATTTAAGCATGTAGTTTGAGGAGGCTAGCTACTTAGAT.GTGTTTATTGTATGCAACCAATT 

GAATCAAGTCATCACCAACTCAAAGTTAAATACGGACAAGCTTTTTATGTATAAAGTTGTTATTTTATCTTGACGAC 

TTATCAAGAGTGAAATU^TGGATTGGGAGTTATGAGTAAACCATCACGACCGCCAAAGATGTACAATGCTTATTTTGA 

GAGAGAAAAATTATATTTCACTCACCAAATATGAGTTGAACCTTGTAACCACATGTATATTACAAAGCTGTGTGTCA 

CCTAACTAATTTGAGGCCTTATCATAGGTAAAATACCTCCAATCTGCACGAATGAGGCACTTTAAA (SEQ ID 

NO: 84) 

The predicted cDNA sequence for the Langdon ZCCT2 gene is as follows: 

ATGCCCATGTCATGCGGTTTGTGCGGCGCAAGCGACTGCCCGCACCACATGATCTCGCCCGTTCTTCAGCATCAGGA 
ACAACACCGGCTGCGCGAGTACCAGTTCTTCACCCAAGGCCACCACCACCACCACCACGACGCGGCGGCGGACTACC 
CACCGCCACCGCCACCGTCAGCCAATTGCCACCACTGCAGATCATGGACCACACCGTTTCATGAAACAGCAGCTGCA 
GGGAACAGCAGCAGGCTCACGCTGGAGGTAGACGCAGGCGGCCAAAACATGGCTCACCTGCTGCAGCCACCGGCACG 
GCCAAGAACCACCATCGTGCCATTCTGCGGGGCTGCATTCACCAGCACTATTAGCAATGCAACGATCATGACTATTG 
ATACAGAGATGATGGTGGGGGCTGCCCATAATCTGACGATGCAGGAGAGAGAGGCGAAGGTGATGAGGTACAGGGAG 
AAGAGGAAGAGGCGGTGCTATGACAAGCAAATCCGCTATGAGTCCAGAAAAGCTTACGCCGAGCTCAGGCCACGGGT 
CAATGGCCGCTTTGTCAAGGTACCAGAAGCCGCTGCATCGTCGTCACCCCCAGCTTCGCCCTATGATCCTAGTAAAC 
TTCACCTCGGATGGTTCCGGTAG (SEQ ID NO: 85) 



The protein sequence for the Langdon ZCCT2 is as follows: 

MPMSCGLCGASDCPHHMISPVLQHQEQHRLREYQFFTQGHHHHHHDAAADYPPPPPPSANCHHCRSWTTPFHETA^ 
NSSRLTLEVDAGGQNMAHLLQPPARPRTTIVPFCGAAFTSTISNATIMTIDTEMMVGAAHNLTMQERE^^^ 

KRRCYDKQIRYESRKAYAELRPRVNGRFVKVPEAAASSSPPASPYDPSKLHLGWFR* (SEQ ID NO: 86) 

> 

Also encompassed within the definition of ZCCT-related proteins are the winter 
barley ZCCT-Ha/Hb proteins. The ZCCT-Ha Dairokkaku Genomic sequence is as follows 
(exon 1 and 2 are in bold): 

CCTCCGCGTAAGGAAGAAATAAATCAAAAATGCATCGAGGGACCGTATCTATTCCGACGCGCTCATTAGTTGGGCCT 
ATTTGATTTGATTTGATCCATCGTTTTTGCTAATTCTCAGGTCGAATCTTTTGTTTGGCCTGCAGCTCAGTGCTGCA 
TATATGCAGTGCAGTGCAGTGCAGGAGGGAGAGACACAATACAGCCCTAGCTTCTTCAAGGTGTTAGTAGCTAGCAC 
TCATCGCTGTCTCTTCTTCTTCCTCGACATCTCTCTTCCACGCACCAGACCACACCAGAAACAAACAAACTAGCAAA 
CAAGCAAACGTTGGAGTTAGCTGCAGTATGTCCATGTCATGTGGTTTGTGCGGCGCCAGCAACTGCGCGTACCACAT 
GATGTCGCCCGTTCTTCTTCATCATCACCATCATCAGGAACACCCACTGCACGAGTACCAGTTCTTCGCCCAAGGTC 
ACCACCACCACCACAGC6CGGCAGCGGACTACCCACCACCACCGCCACCGCCAGACAATTGCCACCACCACAGATCA 
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TGGACCACGCCGTTTCATGAAACAGCAGCTCCAGAGAACAGCACCAGGCTCACACGGGAGGTGGACGCAGGCGGCCA 
ACACATGGCTCACCTGCTGCAGCCACCGGCGCCGCCAAGAGCCACCATCGTGAGTAGTACTACTGCTTAATTTTTCT 
ATCTCTTGCCGATCGATGGGACCTGCTAACAAAAATCACACTTTCTTAATTTCCATCTCAAAAAAAGCTACCGCCAT 
GTGACCAGCTCATATATATGCCACATAACTCCTTTAATTTATTCTGGTCGATTGTAATTTACCAAGGCAGAAAGCTT 
GTATTTTGTATCAGTTGATGCACAAGAATGGGCGCTCACGTCATCAGTCGCACATACTATATACTTATTTCATTTTA 
TTTGACTAACAAGGTAACTAGTTAATTCCTTTATGGGGTCAAGCAATACATATGTGCACGCCTTCATGTTAATTCCT 
TGACAAAGTTTGTGAAGTGGAAAATATATTTACTTTATCAATGCACCTACTCTCATTTTATGTGGTCATTTATGAAT 
TTTATTAATTTTCTGTTGAGCTAGTTTTGTATGCTTATAGCTCATATATAACTGATACTACTCCCCATAATTTTTCC 
GTAGTGGTCGGGTGATCGATCTACCTAGTTCATAAACTTATCGAGATCAGGTCCTU^CAGACCAAAACCTCTVCGAA 
ATGGAAACAAGATCCTTGTTTAATTAGTTTGCATCAGGAAATTGCTTATTACTTGCTGTCAATCTTATGAAGATGGT 
ATTTTCCTCACAAATGGATCCAGTCACAATTGTTGATGAAGTTAAACATTTTTGGCAATTCATAAAACCGTGCATAG 
ATGTCCGGCTACACGCACACAAGTACATAATACACCTAGTTAAAACATATATCCATAGAGCAATTGAGTTTGGACTA 
TGCGCTTCATTGGACACAAAGGCCCGGGAAGTTGTTCTCTTCCATTGTCTAAAAAAATAGAACAGTTACAGTCAAGT 
GCAACACTGAATGAAAATGGATCAAGTTTTGGTTAACAAGAGACCAACTTATACTTCATAAACAAGGAATATCAAGT 
ACATATCTGCTACCCACAAGAAAAGTACACCTTATGACTATTTTCTTCTTGATGTTCAGGTGCCATTCTGCGAGAGT 
GCATTCGCCAGCACTATTAGCAACGCAACGATCATGACTATTGATACAGAAATGATGGTG6GGCCTGCCTATAATCC 
AACGATGCAGGAGAGA6AGGCGAAGGTGATGA66TACAGGGAGAA6AGGAAGAGGCGGCGCTATGACAAGCAAATCC 
GCTACGAGTCCAGAAAAGCTTAC6CCGAGCTCAGGCCACG6GTCAATG6CC6CTTTGCCAAGGTGCCCGAAGCCGTT 
GTGTCTCCATCACCCCCAACTTCCCCCCATGATCCTAGTAAACTTCACCTCGGATGGTTC (SEQ ID NO: 87) 

The predicted cDNA sequence for the ZCCT-Ha Dairokkaku gene is as follows: 

ATGTCCATGTCATGTGGTTTGTGCGGCGCCAGCAACTGCGCGTACCACATGATGTCGCCCGTTCTTCTTCA 
TCATCACCATCATCAGGAACACCCACTGCACGAGTACCAGTTCTTCGCCCAAGGTCACCACCACCACCACAGCGCGG 
CAGCGGACTACCCACCACCACCGCCACCGCCAGACAATTGCCACCACCACAGATCATGGACCACGCCGTTTCATGAA 
ACAGCAGCTCCAGAGAACAGCACCAGGCTCACACGGGAGGTGGACGCAGGCGGCCAACACATGGCTCACCTGCTGCA 
GCCACCGGCGCCGCCAAGAGCCACCATCGTGCCATTCTGCGAGAGTGCATTCGCCAGCACTATTAGCAACGCAACGA 
TCATGACTATTGATACAGAAATGATGGTGGGGCCTGCCTATAATCCAACGATGCAGGAGAGAGAGGCGAAGGTGATG 
AGGTACAGGGAGAAGAGGAAGAGGCGGCGCTATGACAAGCAAATCCGCTACGAGTCCAGAAAAGCTTACGCCGAGCT 
CAGGCCACGGGTCAATGGCCGCTTTGCCAAGGTGCCCGAAGCCGTTGTGTCTCCATCACCCCCAACTTCCCCCCATG 
ATCCTAGTAAACTTCACCTCGGATGGTTC (SEQ ID NO: 88) 

The protein sequence for the ZCCT-Ha Dairokkaku is as follows: 

MSMSCGLCGASNCAYHMMSPVLLHHHHHQEHPLHEYQFFAQGHHHHHSAAADYPPPPPPPDNCHHHRSWTTPFHETA 
APENSTRLTREVDAGGQHMAHLLQPPAPPRATIVPFCESAFASTISNATIMTIDTEMMVGPAYNPTMQEREAKVMRY 
REKRKRRRYDKQIRYESRKAYAELRPRVNGRFAKVPEAWSPSPPTSPHDPSKLHLGWF-- (SEQ ID NO: 89) 
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The genomic DNA sequence of the barley ZCCTl-Hb from Dairokkaku is as 
follows (exon 1 and 2 are in bold): 

TCAAATATTCTAGCAGTGGCCTTGCGTGGACATAAGATCATGTGGTAGATTCCCGGCAAGCAAGGTGTGCA 
TGGCTCCAACTCCTCCGCGTAAGGAAGAAATAAATCAAAAATGCATCGAGGGACCGTATCTATTCCGACGCACTCAT 
TAGTTGGATTTATCTGATTTGATTTTATCCATCGTCTTTTGCTAATTCTCAGATCGAATCTTTTGTCTGGTCTGCAG 
CTCACTGCTGCATATATGCAGTGCAGTGCAGGAGGGAGAGACACAATACAGCCCTAGCTTCTTCAAGGTGCTTTAGT 
AGCTAGCACTCATCGCTGTCTCTTCTTCTTCCTCGACATCTCTCTTCCACGCACCAGACCACACCAGAAACAAACAG 
ACAAGCAAGCAAGCAAGCAAACGTTGGAGCTAGCTGCAGTATGTCCATGGCATGCGGTTTGT6CGGCGCCAGCAATT 
GCCCGTATCACATGATGTCGCCCGTTCTTCTTCATCATCACCATCATCAGGAACATCGGCAGCGCGAGTACCAGTTC 
TTCGCCCAAGGTCACCACCACCACCACCACGGCGC6GCAGCAGACTACCCACCGCCACAGCCACC6CCGGCCAATTG 
CCACCACCGCAGATCATGGGCCACGCTGTTTCATGAAACAGCAGCTCCAGTGAATAGCACCAGGCTCACACAAGAGG 
TGGACGCAGGCGGCCAACAGATGGCTCACCT6CTGCA6CCACCGGCGCCGCCAAGAGCCACCATCGTGAGTACTACT 
GCTTAATCGTTCCATCTCTTCCCGATCGATGTGACTCCTTCTAACAAAAATCACACTTTCTTAATTTCCATCTCAAA 
AAAAGCTAGCGCCATGTGACCAGCTCATATATCTGTCACATAACTCCGTTAATTTATGCTGGTCGATTGTAATTTAC 
CAAGGCAGAAAGTTTGTGTTTTGTATCAGTTGATGCACAAGACTGGATGCTCAGATCATCAGTCACACATACTATAT 
ATTTATTTCATTTTATTTGACTAACAAGGTAATCAGTTAATTCCTTTATGGGGTCAAGCAACATATGTCCACGCCTT 
CATGTTAATTCCTTGGCAGAGTTTGTGAAATAGAAGATATATATTGGGATCAATGCACCCTACCTCTTTCTCATTTT 
ATGTGGTCATTTAAGAATTTGAATGCTATTTTGTATTTAAATTTTCTCTTGAGCTAGTGTGTAAGCTTATAGCTCAT 
ATATAACTGATACTACTCCCCATATTGCTTCCATAGTGGCCGGGTGATTGATCTACCGAGTTCATGAAACTGATCAA 
GATCAGGTCCAAAACAGGCCAAAACCTCACGAAATGGAATTACGATCCTTGTTTAATTAGTTTGCATCAGGAAATTG 
GCTACTTAATTACTTGCTACCAATCTTATGAAGATGGCATGTTTCCTCACAAATGGATCCAGCTCACAATTTTTGGT 
GAAGTTAAACATTTTTTAGCAATTCATAAAAGGTGCATAGATGTACAGGGCTACACGTACACACGCACATAATACGC 
CTAGTTAAAACATATATGCATAGAGCAATTGAGTTTGGACAATGCGCTTCTTTGGACATAATGGCCCGGGAAATTGT 
TCTCTTCCATTGTCTAAAAACATAGAACAGTTAGAATCAAGTGCACCACTGAATGAGAATGGGTCAATTTTTGGTTA 
ACGAGAGACCAACTATACGTTATAAACACTGTACTACTCTCACCATTGTTTTCCTCTCGATGTTCAGGTGCCATTCC 
GCCGGA6T6CATTCACCAACACTATTAGCAAC6CAACGATCATGACTATTGATACAGA6ATGAT6GC6GGGACTGCC 
TATAGTCCAACGATGCAGGAAAGAGAAGCAAAGGTGATGAGGTACAGGGAGAAGAGGAAGAA6CGGCGCTATGACAA 
GCAAATCCGCTACGAGTCCAGAAAAGCTTAC6CCGAGCTTA6GCCAC6GGTCAACG6CCGCTTTGTCAAGGTACCTG 
AAGCCGCTGCGTCACCATCACCCCCAGCTTCGCCCCATGATCCTAGTGAACTTCACCTCGGATGGTTC ( SEQ ID 
NO: 90) 

The predicted cDNA sequence for the barley ZCCTl-Hb from Dairokkaku is as 
follows: 

ATGTCCATGGCATGCGGTTTGTGCGGCGCCAGCAATTGCCCGTATCACATGATGTCGCCCGTTCTTCTTCATCATCA 
CCATCATCAGGAACATCGGCAGCGCGAGTACCAGTTCTTCGCCCAAGGTCACCACCACCACCACCACGGCGCGGCAG 
CAGACTACCCACCGCCACAGCCACCGCCGGCCAATTGCCACCACCGCAGATCATGGGCCACGCTGTTTCATGAAACA 
GCAGCTCCAGTGAATAGCACCAGGCTCACACAAGAGGTGGACGCAGGCGGCCAACAGATGGCTCACCTGCTGCAGCC 
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ACCGGCGCCGCCAAGAGCCACCATCGTGCCATTCCGCCGGAGTGCATTCACCAACACTATTAGCAACGCAACGATCA 
TGACTATTGATACAGAGATGATGGCGGGGACTGCCTATAGTCCAACGATGCAGGAAAGAGAAGCAAAGGTGATGAGG 
TACAGGGAGAAGAGGAAGAAGCGGCGCTATGACAAGCAAATCCGCTACGAGTCCAGAAAAGCTTACGCCGAGCTTAG 
GCCACGGGTCAACGGCCGCTTTGTCAAGGTACCTGAAGCCGCTGCGTCACCATCACCCCCAGCTTCGCCCCATGATC 
CTAGTGAACTTCACCTCGGATGGTTC (SEQ ID NO: 91) 

The protein sequence for the barley ZCCT-Hb Dairokkaku (which alone is not 
sufficient alone to generate a vernalization requirement) is as follows: 

MSMACGLCGASNCPYHMMSPVLLHHHHHQEHRQREYQFFAQGHHHHHHGAAADYPPPQPPPANCHHRRSWATLFHET 
AAPWSTRLTQEVDAGGQQMAHLLQPPAPPRATIVPFRRSAFTNTISNATIMTIDTEMMAGTAYSPTMQEREAKVMR 
YREKRKKRRYDKQIRYESRKAYAELRPRVNGRFVKVPEAAASPSPPASPHDPSELHLGWF-- (SEQ ID NO: 
92) 

ZCCTl Promoter: A ZCCTl promoter is a promoter from the ZCCTl gene. 

ZCCri promoters are generally found 5' to the ZCCTl protein coding sequence and 
regulate expression of the ZCCTl gene. ZCCTl promoter sequences as defined herein 
include those sequences that hybridize under high stringency conditions to promoter 
regions contained in the nucleic acids of SEQ ID NO:74 and 78. Such sequences can be 
synthesized chemically or they can be isolated from plants. ZCCTl promoters can be 
spring or winter ZCCTl promoters, for example, spring wheat or winter wheat ZCCTl 
promoters. Representative plants from which ZCCTl promoters can be isolated include 
wheat (spring and winter). Functional ZCCTl promoters are preferred for their 
responsiveness to vernalization. However, non-functional ZCCTl promoters are 
included and may be used for example as probes for detecting spring phenotype or as 
part of a nucleotide used for homologous recombination to convert winter varieties to 
spring varieties. 

ZCCT-related Promoter: A ZCCT -related promoter is a promoter from a 
ZCCTl gene or a ZCCT-related protein gene. ZCCT -related proteins promoters are 
generally found 5' to the ZCCTl protein coding sequence or ZCCT-related protein 
coding sequence and regulate expression of the operably linked gene. ZCCT-1 related 
promoters are characterized by their down-regulation in response to vernalization. 
ZCCT-related promoter sequences as defined herein include those sequences that 
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hybridize under high stringency conditions to promoter regions contained in the nucleic 
adds of SEQ ID NO:74, 78, 81, 84, 87, and 90. Such sequences can be synthesized 
chemically or they can be isolated from plants. Representative plants from which 
ZCCTl promoters can be isolated include wheat, barley, rye, triticale, oat and forage 
grasses. 

Tal<ing into account these definitions, the present invention Is directed to the 
finding that differences in the sequence of the wheat API promoter and/or the wheat 
ZCCTl protein are the determining factors in distinguishing winter wheat from spring 
wheat. Mutations in one or both of these two regions eliminate the requirement for 
vernalization to flower. This has been demonstrated In wheat and barley and is Inferred 
to be common to all temperate grasses that have a vernalization response. Winter 
wheats require several weeks at low temperature to flower. This process called 
vernalization is controlled mainly by the VRNl gene which in turn is repressed directly 
or indirectly by the gene product of the l/7?/V2gene. As detailed In Example 1, using 
6,190 gametes VRNl was found to be completely linked to MADS-box genes API and 
AGLGl in a 0.03-cM interval flanked by genes Cysteine and Cytochrome B5. No 
additional genes were found between the last two genes in 324-kb of wheat sequence 
or in the colinear regions in rice and sorghum. Example 1 further shows that API 
transcription Is regulated by vernalization In both apices and leaves, and the progressive 
increase of API transcription was consistent with the progressive effect of vernalization 
on flowering time. In addition. Example 1 indicates that vernalization is required for API 
transcription in apices and leaves in winter wheat but not in spring wheat. No 
differences were detected between genotypes with different VRNl alleles in the API 
and AGLGl coding regions, but three independent deletions were found in the promoter 
region of API. 

In particular, all accessions with deletions that affect all, a portion or an adjacent 
region to the CArG box region (SEQ ID NO:23) in the wheat API promoter sequence 
have a spring growth habit. These results and the relatively later expression of AGLGl 
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during the flowering process demonstrate that API is a better candidate for VRNl than 
AGLGl. 

The epistatic interactions between vernalization genes VRNl and VRN2 
suggested a model in which l^/?/V2 would repress directly or indirectly the expression of 
API (Fig. 12). As explained in detail below, a mutation in the CAr-G section of the 
promoter region of API or adjacent regions would result in the lack of recognition of the 
repressor and In a dominant spring growth habit. The present Invention Is directed to 
this finding and the finding that ZCCTl acts to repress the API gene and their 
application to plant molecular biology and plant breeding. 

As detailed in Example 1, k/?yV2 was found to be completely linked to ZCCTl and 
ZCCT2, two closely related homologs. Vernalization resulted in a gradual and stable 
repression of ZCCTl transcription In leaves and apices. ZCCT2y\ias not detected in the 
apices. The identity between ZCCTl and l/7?/V2 was confirmed by the association of the 
ir/72 allele for spring growth phenotype with four Independent ZCCTl mutations, and by 
the elimination of the vernalization requirement in transgenic winter wheat by RNA 
Interference as Illustrated in Example 2. 

The API Promoter 

The Isolation and sequence analysis of the wheat API promoter and the 
determination that it is the controlling factor in distinguishing winter wheat from spring 
wheat has broad applications In plant molecular biology and plant breeding. 

As a first embodiment, the present invention Is directed to the API promoter 
Isolated from spring wheat. The winter wheat API promoter sequence, G3116, 
depicted in Figure 9 (SEQ ID NO: 12) contains a core CArG box sequence 
CCTCGTTTTGG (SEQ ID NO:23). The spring wheat API promoter sequence lacks all, a 
portion, or adjacent sequence to this core sequence. As such, the present invention Is 
directed to a recombinant API promoter sequence wherein the API promoter sequence 
hybridizes to the nucleic acid molecule of SEQ ID NO: 12 or the complement thereof 
under high stringency conditions wherein the API promoter sequence lacks all or a 
portion of nucleotides -162 to -172 upstream the start codon of SEQ ID NO: 9, 
CCTCGTTTTGG (SEQ ID NO:23). Representative, but non-llmiting spring wheat API 
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sequences are depicted in Figure 11 as SEQ ID NO: 14-17 wlierein a portion of the 
CCrCb 1 1 1 1 GG sequence is deleted or altered. An API promoter sequence Is said to 
lack all or a portion of the CCTCGTTTTGG sequence If 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or 11 
of the nucleotides of SEQ ID NO:23 are missing, absent, mutated, or subject to a 
sequence change or a deletion has been introduced in the sequence. The mutation 
may be an inversion, a reversion or other alteration In the sequence. 

Alteration of the region close the putative TATA box upstream of the CArG box 
also result in a spring growth habit in polyploid species of wheat (Fig. 11, SEQ ID 
N0:112, 113, 114, 115, 116). 

Vectors 

The promoters or the coding regions of the API and ZCCT genes of the present 
invention may be cloned into a suitable vector. Expression vectors are well known in 
the art and provide a means to transfer and express an exogenous nucleic acid 
molecule into a host cell. Thus, an expression vector contains, for example, 
transcription start and stop sites such as a TATA sequence and a poly-A signal 
sequence, as well as a translation start site such as a ribosome binding site and a stop 
codon, if not present in the coding sequence. A vector can be a cloning vector or an 
expression vector and provides a means to transfer an exogenous nucleic acid molecule 
Into a host cell, which can be a prokaryotic or eukaryotic cell. Such vectors Include 
plasmids, cosmids, phage vectors and viral vectors. Various vectors and methods for 
Introducing such vectors into a cell are described, for example, by Sambrook et al. 
1989. 

The Invention also provides an expression vector containing an API promoter 
nucleic acid molecule operably linked to a protein coding sequence. For this construct, 
the API promoter may be from any temperate grass but Is preferably from a winter 
wheat or a spring wheat. In another format, the present Invention Is directed to a 
recombinant API promoter sequence linked to an API protein. 

The invention also provides an expression vector containing a ZCCTl or ZCCTl 
derived protein coding sequence operably linked to a promoter. The promoter may be 
constitutive or inducible. 
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The invention furtfier provides an expression vector containing a ZCCTl 
promoter nucleic acid molecule operably linlced to a protein coding sequence. For this 
construct, the ZCCTl promoter may be from any temperate grass but is preferably from 
a winter wheat or a spring wheat. 

In the constructs of the invention, each component is operably linked to the 
next. For example, where the construct comprises the spring wheat API promoter, and 
protein encoding sequence, preferably, the wheat API protein, the API promoter Is 
operably linked to the 5' end of the wheat API protein encoding sequence or open 
reading frame. 

The API coding sequence may be from wheat or other API protein coding 
sequences as defined herein. The protein coding sequence linked to the API promoter 
may be an API protein sequence or another heterologous protein. The heterologous 
proteins which find use in the invention include those that provide resistance to plant 
pests, facilitate translocation of nutrients, provide resistance to stresses typical of the 
summer: heat and dehydration, etc. 

The constructs of the Invention may be introduced into transgenic plants. A 
number of recombinant vectors suitable for stable transformation of plant cells or for 
the establishment of transgenic plants have been described including those described in 
Weissbach and Weissbach (1988), and Gelvin etal. (1990). Typically, plant 
transformation vectors include one or more open reading frames (ORFs) under the 
transcriptional control of 5' and 3' regulatory sequences and a dominant selectable 
marker with 5' and 3' regulatory sequences. Dominant selectable marker genes that 
allow for the ready selection of transformants include those encoding antibiotic 
resistance genes (e.g., resistance to hygromycin, kanamycin, bleomycin, G418, 
streptomycin or spectlnomycin) and herbicide resistance genes (e.g., phosphinothricin 
acetyltransferase). 

Standard molecular biology methods, such as the polymerase chain reaction, 
restriction enzyme digestion, and/or ligation may be employed to produce these 
constructs. 
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Transgenic Plants 

Standard molecular biology methods and plant transformation techniques can be 
used to produce transgenic plants that produce plants having a recombinant API 
promoter. 

Introduction of the selected construct into plants is typically achieved using 
standard transformation techniques. The basic approach is to: (a) clone the construct 
into a transformation vector, which (b) is then introduced into plant cells by one of a 
number of techniques (e.g., electroporation, microparticle bombardment, 
Agrobacterium infection); (c) identify the transformed plant cells; (d) regenerate whole 
plants from the identified plant cells, and (d) select progeny plants containing the 
introduced construct. 

Preferably all or part of the transformation vector will stably integrate into the 
genome of the plant cell. That part of the transformation vector which integrates into 
the plant cell and which contains the introduced recombinant sequence may be referred 
to as the recombinant expression cassette. 

Selection of progeny plants containing the introduced transgene may be made 
based upon the detection of the recombinant API promoter in transgenic plants, upon 
the detection of the recombinant ZCCT-related protein coding sequence or upon 
enhanced resistance to a chemical agent (such as an antibiotic) as a result of the 
inclusion of a dominant selectable marker gene incorporated into the transformation 
vector. 

Successful examples of the modification of plant characteristics by transformation 
with cloned nucleic acid sequences are replete in the technical and scientific literature. 
Selected examples, which serve to illustrate the knowledge in this field of technology 
include: U.S. Patent No.5,571,706 ("Plant Virus Resistance Gene and Methods"); U.S. 
Patent No.5,677,175 ("Plant Pathogen Induced Proteins"); U.S. Patent No. 5,510,471 
("Chimeric Gene for the Transformation of Plants"); U.S. Patent No.5,750,386 
("Pathogen-Resistant Transgenic Plants"); U.S. Patent No.5,597,945 ("Plants Genetically 
Enhanced for Disease Resistance"); U.S. Patent No.5,589,615 ("Process for the 
Production of Transgenic Plants with Increased Nutritional Value Via the Expression of 
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Modified 2S Storage Albumins"); U.S. Patent No.5,750,871 ('Transformation and 
Foreign Gene Expression in Brassica Species"); U.S. Patent No. 5,268,526 
("Overexpression of Pliytochrome In Transgenic Plants"); U.S. Patent No. 5,780,708 
("Fertile Transgenic Corn Plants"); U.S. Patent No. 5,538,880 ("Method for Preparing 
Fertile Transgenic Corn Plants"); U.S. Patent No. 5,773,269 ("Fertile Transgenic Oat 
Plants"); U.S. Patent No. 5,736,369 ("Method for Producing Transgenic Cereal Plants"); 
U.S. Patent No. 5,610,049 ("Methods for Stable Transformation of Wheat"); U.S. Patent 
No. 6,235,529 ("Compositions and Methods for Plant Transformation and 
Regeneration") all of which are hereby incorporated by reference in their entirety. 
These examples include descriptions of transformation vector selection, transformation 
techniques and the construction of constructs designed to express an Introduced 
transgene. . 

The transgene-expressing constructs of the present invention may be usefully 
expressed in a wide range of higher plants where an altered response to vernalization Is 
useful. The invention Is expected to be particularly applicable to monocotyledonous 
cereal plants including barley, wheat, rye, triticale, oat and forage grasses. 

Methods for the transformation and regeneration of monocotyledonous plant 
cells are known, and the appropriate transformation technique will be determined by 
the practitioner. The choice of method will vary with the type of plant to be 
transformed; those skilled In the art will recognize the suitability of particular methods 
for given plant types. Suitable methods may Include, but are not limited to: 
electroporation of plant protoplasts; liposome-medlated transformation; polyethylene 
glycol (PEG-mediated transformation); transformation using viruses; micro-injection of 
plant cells; micro-projectile bombardment of plant cells; vacuum infiltration; and 
Agmbacterium-\\)&^\^\.&A transformation. Typical procedures for transforming and 
regenerating plants are described in the patent documents listed above. 

Following transformation, transformants are preferably selected using a dominant 
. selectable marker. Typically, such a marker will confer antibiotic or herbicide resistance 
on the seedlings of transformed plants, and selection of transformants can be 
accomplished by exposing the seedlings to appropriate concentrations of the antibiotic 
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or herbicide. After transformed plants are selected and grown the plant can be assayed 
for expression of recombinant proteins. 

Uses of the Transgenic Plants of the Invention 

The transgenic plants of the invention are useful In that they exhibit an altered 
response to vernalization or altered flowering time. An altered flowering time means 
that the transformed plant will flower at a different time than the untransformed plant 
or may not flower at all. As defined herein, an altered response to vernalization means 
that the transgenic plant will respond differently to vernalization than a comparable 
non-transgenic plant. In one embodiment, a transgenic winter wheat expressing a 
recombinant spring wheat API promoter operably coupled to an API polypeptide 
sequence will exhibit an altered response to vernalization in that the recombinant API 
protein will be expressed in the absence of vernalization and the plant will flower 
without the requirement of vernalization. In other words, a winter genotype would be 
transformed into a spring phenotype. Such expression contrasts with the expression of 
the endogenous (non-recombinant) API protein In the transgenic plant, which requires 
vernalization for expression. 

The protein coding sequence linked to the API promoter or ZCCTl promoter may 
be also any heterologous protein. Heterologous proteins useful in the invention include 
proteins encoded by polynucleotides from any source, natural or synthetic. Suitable 
coding regions encode animal RNAs or polypeptides, as well as variants, fragments and 
derivatives thereof. The encoded products may be recovered for use outside the host 
plant cell (e.g., therapeutically active products) or they may alter the phenotype of the 
host plant cell (e.g., conferring disease resistance, the ability to survive or grow In the 
presence of particular substrates). Examples of such coding regions Include 
polynucleotides derived from vertebrates, such as mammalian coding regions for RNAs 
(e.g., anti-sense RNAs, ribozymes, and chimeric RNAs having ribozyme structure and 
activity) or polypeptides (e.g., human polypeptide coding regions). Other coding regions 
useful in the inventive methods are derived from invertebrates (e.g., insects), plants 
(e.g., crop plants), and other life forms such as yeast, fungi and bacteria. The 
heterologous proteins which find particular use In the Invention Include those that 
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provide resistance to plant pests, facilitate translocation of nutrients, provide resistance 
to stresses typical of the summer: heat and dehydration, etc. Such protein sequences 
are available in the literature and known to those of skill in the art. Representative 
proteins of Interest are described and disclosed in Lea and Leegood (1998); Grierson 
and Covey (1991); and Buchanan et al. (2001), all of which are hereby incorporated by 
reference in their entirety. 

In another embodiment, a transgenic plant will express any protein only after 
vernalization or flowering if linked to the API promoter or only before vernalization if 
linked to the ZCCTl promoter. This could be useful to avoid the expression of the 
transgene during the vegetative growth and to direct its expression to the flowering 
period of the plant. Alternatively, this could be used to express the transgene during 
the vegetative growth phase but not during the flowering period of the plant. For 
example, a transgene could be operatively linked to the API promoter. Such a 
construct in winter wheat would only be expressed after vernalization. 

In another embodiment, flowering In wheat or other temperate grasses may be 
regulated by stimuli other than vernalization. This may be achieved by replacement of 
the endogenous API gene with an API gene operably linked to an Inducible promoter. 
Thus, expression of the API gene may be induced in response to exposure to a 
particular stimulus such as pathogen exposure, wounding, heat exposure, chemical 
exposure, etc. so that the plant will flower at a controlled time or under certain 
conditions. In addition, controlled vernalization may be achieved by addition of a ZCCT- 
related protein coding gene operably linked to an inducible promoter. Then removal of 
the stimulus that increases expression or addition of the stimulus that induces 
repression can stimulate flowering by derepression of API. In yet another embodiment, 
the expression of the API gene or the ZCCTl gene may be regulated by RNAI or 
antisense gene operably linked to an inducible promoter. 

Both delay of flowering by RNAI repression of API, and acceleration of flowering 
by RNAI repression of ZCCTl have been confirmed experimentally. In both cases the 
variation In flowering time observed in a TO plant, cosegregated with the transgene in 
the Tl progeny (Fig. 15). 
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In yet another aspect of the present invention, a plant that normaily requires 
vernaiization, such as winter wheat, may be modified to no longer require vernalization 
in order to flower. Such plants may be generated by a number of methods. In one 
embodiment, the plant may be supplied with an API promoter that is not repressed 
prior to vernalization operably linl<ed to an API gene. In another embodiment, the 
plant's endogenous ZCCTl activity may be Inhibited. The ZCCTl activity may be 
inhibited by a wide variety of methods. B<amples Include repression with RNAi (Fig. 
15A) or antisense gene expression, knockout of the ZCCTl gene or promoter, 
overexpression of a repression defective ZCCT-related protein that competes with the 
endogenous ZCCTl for the ZCCTl DNA binding site, overexpression of a DNA binding 
defective ZCCT-related protein that competes with the endogenous ZCCTl for 
associated proteins Involved In repressing API, or replacement of the endogenous 
ZCCTl protein with a defective ZCCTl protein by homologous recombination for 
example. 

In still another aspect, temperate grasses that never flower or have a long 
delayed flowering may be generated for use as forage or In situations where flowing is 
not desired such as golf courses. Such plants may be generated by expression of a 
ZCCT-related protein operably linked to a constitutive promoter. In another 
embodiment, the API activity may be permanently or greatly repressed by RNAi or 
antisense gene expression. 

Plants Produced by Plant Breeding 

Results presented here demonstrated that the allelic variation at the API gene is 
responsible for the allelic variation at the Vrnl gene from wheat. Therefore allelic 
variation at the API gene can be used as a molecular marker for the Vrnl gene In 
marker assisted selection programs. Similarly, the allelic variation at the ZCCTl gene Is 
responsible for the allelic variation at the Vrn2 gene from diploid wheat. Therefore 
allelic variation at the ZCCTl gene can be used as a molecular marker for the Vrn2 
gene in marker assisted selection programs. Marker-assisted breeding is a procedure 
well known in the art as described In Hayward, et al. (1993). 
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These markers can be used to transfer different Vrnl and/or Vrn2 alleles into 
different germplasm by marker-assisted selection. They can also be used to detiermlne 
the different haplotypes present in this region in the cultivated wheats and to establish 
a classification of the different haplotypes. This characterization will be useful to 
determine the adaptive value of the different haplotypes to different environments. 

This invention relates to the use of allelic variation at any of the genes present In 
Figure 1 as molecular markers for the Vrnl gene or In Figure 13 as molecular markers 
for the Vrn2 gene. 

This invention will be better understood by reference to the following non- 
limiting examples. 

EXAMPLE 1 

Background 

VRNl and W?/V2 (unrelated to the genes with similar names in Arabidopsis) are 
the main genes involved in the vernalization response in diploid wheat Triticum 
monococcum (Dubcovsky, J., et al. (1998), Tranquilli, G. E., etal. (1999)). (Full 
citations for the references cited herein are provided before the claims.) However, 
most of the variation in the vernalization requirement in the economically important 
polyploid species of wheat Is controlled by the VRNl locus (Tranquilli, G. E., etal. 
(1999), Law, C. N., etal. (1975)). This gene is critical In polyploid wheats for their 
adaptation to autumn sowing and divides wheat varieties into the winter and spring 
market classes. 

The VRNl gene has been mapped in colinear regions of the long arm of 
chromosomes 5A (Dubcovsky, J., et al. (1998), Law, C. N., etal. (1975), Galiba, G., et 
al. (1995)), 5B (Iwaki, K., etal. (2002), Barrett, B., etal. (2002)) and 5D (Law, C. N., et 
al. (1975)). This region of wheat chromosome 5 is colinear with a region from rice 
chromosome 3 that includes the HD-6 QTL for heading date (Kato, K., etal. (1999)). 
However, it was recently demonstrated that VRNl and HD-6 are different genes (Kato, 
K., etal. (2002)). 
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In spite of the progress made in the elucidation of the vernalization pathway In 
Arabldopsis, little progress has been made in the characterization of wheat vernalization 
genes. The two main genes involved in the vernalization pathway in Arabldopsis, FRI 
and FLC (Michaels, S. D., etal. (1999), Sheldon, C. C, etal. (2000), Johanson, U., et 
al. (2000)), have no clear homologues In the complete draft sequences of the rice, 
genome (Goff, S. A., etal. (2002)). This may not be surprising considering that rice Is a 
subtropical species that has no vernalization requirement. Since no clear orthologues of 
the Arabldopsis vernalization genes were found in rice or among the wheat or barley 
ESTs, a map based cloning project for the wheat VRNl gene was initiated. 

Chromosome walking In wheat is not a trivial exercise because of the large size 
of its genomes (5,600 Mb per haplold genome) and the abundance of repetitive 
elements (Wicker, T., etal. (2001), SanMiguel, P., etal. (2002)). To minimize the 
probability that these repetitive elements would stop the chromosome walking, 
simultaneous efforts were Initiated In the orthologous regions In rice, sorghum, and 
wheat. The Initial sequencing of rice, sorghum, and wheat BACs selected with RFLP 
marker WG644 (0.1 cM from VRNl) showed good microcollinearlty among these genera 
(SanMiguel, P., etal. (2002), Dubcovsky, J., etal. (2001), Ramakrishna, W., etal. 
(2002)). The low gene density observed in the wheat region and the large ratio of 
physical to genetic distances (SanMiguel, P., etal. (2002)) suggested that large 
mapping populations and comparative physical maps would be necessary for a 
successful positional cloning of VRNl. 

Materials and Methods 

Mapping population 
The high-density map was based on 3,095 F2 plants from the cross between T. 
monococcum ssp. aeg/lopo/des accessions G2528 (spring, VRNl) with G1777 (winter, 
vml). These two lines have the same dominant allele at the VRN2 locus and therefore, 
plants from this cross segregate only for VRNl in a clear 3:1 ratio (Dubcovsky, J., etal. 
(1998), Tranquilli, G. E., et al. (1999)). 
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Plants were grown in a greenhouse at ZO-ZS^C without vernalization and under 
long photoperlod (16-h light). Under these conditions, winter plants flowered one to 
two months later than spring plants. Fz plants ^re analyzed for molecular markers 
flanking VRNl, and progeny tests are performed for plants showing recombination 
between these markers. The 20-25 individual F3 plants from each progeny test were 
characterized with molecular markers flanking the crossover to confirm that the 
observed segregation in growth habit was determined by variation at the VRNl locus. 
G2528 and G1777 were included as controls in each progeny test. 

For studies to confirm that the CArG-box is the critical site for the recognition of 
the vernalization signal the following steps were taken. PI503874 (spring wheat with a 
single bp deletion in the CArG box SEQ 17) was crossed G3116 (winter wheat). An Fl 
plant was produced with spring flowering habits (no vernalization requirement) 
indicative of a dominant spring growth habit. Fl plants were self-pollinated to produce 
144 F2 seeds. The 144 F2 seeds were planted in cones and grown without vernalization 
in the green house under long day conditions. DNA was extracted from each of the 
plants and the promoter region was amplified by PGR. The amplified PGR fragment was 
digested with a restriction enzyme that cut only the sequence without the one base pair 
deletion. 

Procedures for genomic DNA extraction. Southern blots, and hybridizations were 
described before (Dubcovsy, J., etal. (1994)). The first 500 F2 plants were screened 
with flanking RFLP markers CDO708 and WG644, which were later replaced by closer 
PGR markers to screen the complete mapping population. Additional markers were 
developed for the eight genes present between the PGR markers as detailed below. 

Molecular markers 

Molecular markers were developed for the high-density map of the Triticum 
monococcum Vrnl vernalization gene as depicted in Figures 1-2. The information is 
organized by the order of genes in Figure 1. All primers are 5' to 3'. The Gleavage 
Amplification Polymorphic Sequence (GAPS) markers show PGR products digested with 
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the polymorphic restriction enzyme. The PCR products are detectable by gel 
electrophoresis. 

a) RFLP marker WG644 

Sequence of the WG644 showed that this RFLP marker was part of GENE4 
(putative ABC transporter gene) present in T. monococcum BAC clone 115G01 
(AF459639). This wheat RFLP marker was polymorphic between G1777 and G2528 with 
restriction enzyme Oral. 

b) GENEl (putative Mitochondrial Carrier Protein, AF459639) 
Primers 

GENEl-F CCAGCGTATGATTTGGAGGT (SEQ ID NO: 24) 
GENEl-R TTGGCATTATTGGACCTVTCA (SEQ ID NO: 25) 

Sequence of the G1777 (AY244503) and G2528 (AY244504) alleles showed a 

polymorphic Taq I restriction site. This polymorphism was used to develop a CAPS 

marker. 

c) PCSl {.Pliytociielatin synthetase) 
Primers 

PCSl-F CTGACCTGGGGCCTTGAGAG (SEQ ID NO: 26) 
PCSl-R OTCGCATCAGCAGCTCTAT (SEQ ID NO: 27) 

These primers amplified a 507 bp region of the Phytochelatin Synthetase 

pseudogene (AY188332). This wheat RFLP marker was polymorphic between G1777 

and G2528 with restriction enzyme DrA. 

d) PCS2 {Phytochelatin synthetase) 
Primers 

PCS2-F CCATGGATAATCATCGGGAG (SEQ ID NO:28) 
PCS2-R GTCACCATCACCAACTTCAA (SEQ ID NO:29) 

Primers were used to amplify a region of the Phytochelatin Synthetase 2 gene 

(Exons 3-4) from barley variety Morex (AY244504). This RFLP marker was polymorphic 

between G1777 and G2528 with restriction enzyme Eco RI. 

e) CYB5 ( Cytochrome 65) 
Primers 

CYB5-F GACTGCGTATTTGGACGACC (SEQ ID NO:30) 
CYB5-R CCACGGCTGATATCCCGACTG (SEQ ID N0:31) 
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These primers amplify a 373-bp region of Cytoclirome B5 gene (Exons 2-3) from 
T. monococcum BAG clone 609E06 (AY188332). This RFLP marker was polymorphic 
between G1777 and G2528 with restriction enzyme Eco RI. 

f) AGLGl (MADS-box) 
Primers 

AGLGl-F GACCCrCGAGAGGTACCAG (SEQ ID NO:32) 
AGLGl-R CATCTACACTACGATCTAGC (SEQ ID NO:33) 

These primers amplified exon2 and intron 2 of AGLGl from T. monococcum BAG 

719G13. Sequence of the G1777 (YA244506) and G2528 (YA244507) alleles showed 

two polymorphic Dpn II restriction sites. A cDNA from this gene (BE430753) was also 

mapped by RFLP using Eco RI to delimit the region of the crossover between AGLGl 

and CYB5. 

g) PHY-C{Phytochmme-Q 

Primers based on barley EST BE060096 

PHY-G-F GAAAATGTGTGAAGAAGCTGCT (SEQ ID NO:34) 
PHY-G-R TCTAGATGAGGAATCTGGAT (SEQ ID NO:35) 

These primers were used to amplify a 750-bp product fonn G1777 (AY244514) 

that was used as an RFLP probe to map a Hha I polymorphism. 

h) ADA2 ( Transcriptional Adaptor^ Zea mays AJ430205) 

Primers based on T. aestivum EST BJ309328 

ADA2-F GAAGATGCACTTGGAGAAGG (SEQ ID NO:36) 
ADA2-R GTCTCTTTGGATTGTACGGA (SEQ ID NO:37) 

These primers were used to amplif/ a 700-bp. product from G1777 (AY244515) 

that was used as an RFLP probe to map an /?5a I polymorphism. 

i) MTK4 {Tous/ed-ii/ce Kinase, iKCOSmi) 
Primers 

MTK4_F GGTAAAAGATGAGGAAGGAG (SEQ ID NO:38) 
MTK4-R TGTATCTATGGTGAAGTCTTACTTG (SEQ ID NO:39) 

Sequencing of G1777 (AY244512) and G2528 (AY244513) alleles with these 

primers showed a polymorphic Dpn II restriction site that was used to develop a GAPS 

marker. 

j) CDO708 {putatw/e RNA-binding protein, M:0918n) 
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The CDO708 clone was sequenced with primer Ml 3 Forward. This sequence had 
a high homology to putative RNA-bind/ng protein AAL58954.1 from rice BAG AC091811. 
This clone was used as an RFLP probe to map a Hha I polymorphism between G1777 
and G2528. 

Contig construction and BAG sequencing 

High-density filters for the BAG libraries from T. monococcum accession DV92 
(Lijavetzky, D., etal. (1999)), Oryza sat/Va war. Nipponbare (Zhang, H.-B., eta/. 
(1996)), and Sorghum b/co/or(\Noo, S. S., eta/. (1994)) were screened with segments 
from the different genes indicated In Figure 1. Gontigs were assembled using /iindWl 
fingerprinting and confirmed by hybridization of BAG ends obtained by plasmid rescue, 
inverse PGR (Woo, S. S., eta/. (1994)) or BAG sequencing. Restriction maps using 
single and double digestions with eight-cutter restriction enzymes, pulse field 
electrophoresis, and hybridization of the Southern blots with different genes, were used 
to order genes within the BACs, to select the fragment sequenced from the sorghum 
BAG, and to confirm the assembly results from the BAG sequencing. Shotgun libraries 
for BAG sequencing were constructed as described before (Ducovsky, J., eta/. (2001)). 
Gomplete T. monococcum BAGs 609E06, 719G13, and 231A16 and a 24-kb fragment 
from sorghum BAG 17E12 were sequenced. Genes were identified by a combination of 
comparative genomic analysis, BLAST searches and gene-finding programs. 

Phylogenetic analysis 

A phylogenetic study was performed using the two wheat MADS-box genes 
found in this study and 24 additional MADS-box genes (Figure 3). Phylogenetic trees 
were generated from the GlustalW sequence alignments of the complete proteins using 
multiple distance- and parsimony- based methods available in the MEGA2.1 computer 
software package (Kumar, S., eta/. (1994)). Distances between each pair of proteins 
were calculated and a tree was constructed using the Neighbor-Joining algorithm. The 
consensus tree and the confidence values for the nodes were calculated using 1000 
bootstraps (MEGA2.1). GenBank accessions used in the study were SQUAMOSA: 
H\mS (GAB97352.1), OsNPl (AAM34398.1), /yuBM8 (GAB97354.1), H\mZ 
(GAB97351.1), AlFiiL (Q38876), AOZkL (NP_564243.1), AtA?l (GAA78909.1); AGI^: 
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0!sAGLE21 (AAM34397.1), GsMADSS (AAB71434.1), GsMADSl (AAA66187.1), H\m7 
(CAB97353.1), AtPGU (AAB38975.1), /1ZAGL4 (AAA32734.1), AtAGlZ (AAA32732.1), 
>1/AGL9 (AAB67832.1), OsMADSB (AAC49817.1), H\SM9 (CAB97355.1), asMADS7 
(AAC49816.1), (9SMADS45 (AAB50180.1); OTHER: AS'LC (AAD21249.1), AtAGlS 
(AAA79328.1), /460C1 (AAG16297.1), AiAGLl (AAA32730.1), >1iAP3 (AAA32740.1). 
RT-PCR and Quantitative PGR 

RNA from leaves, undifferentiated apices, and young spikes was extracted using 
the TRIZOL metliod (INVITROGEN). RT-PCR procedures were performed as described 
before (Yan, L., eta/. (2002)). Quantitative PGR experiments were performed in an 
ABI7700 using three Taql^an® systems for T. monococcum API and for >1C77/Vand 
UBIQUrUN^s endogenous controls. RT-PCR and Quantitative PCR experiments 
fovTriticum monococcum API gene. All probes and primers are indicated In 5' to 3' 
orientation. RT-PCR reactions were performed using Superscript II (Invitrogen ®) and 
primed witii oligo(dT)i2-i8. 

The 2"^S- method (Uval<, K. J., etal. (2001)) was used to normalize and 
calibrate the Ct values of wheat API relative to the endogenous controls. For the 
vernalization time course, RNA was extracted from the youngest fully expanded leaf of 
five winter T. monococcum plants (1 month old) immediately before moving the plants 
into the cold room, and then after 2, 4, and 6 weeks of vernalization (4°C). The last 
sample was collected two weeks after moving plants to the greenhouse (20''C). Plants 
kept in the greenhouse were sampled as controls at each time point simultaneously 
with the plants from the cold room (5 plants per time point), 
a) RT-PCR 

RT-PCR reactions were performed using superscript II (Invitrogen ®) and primed 
with ollgol (*"") 12-18. AM probes and primers are Indicated In the 5' to 3' orientation. 
API 

The Left primer, Exon 3 was GGAAACTGGTGTCACGAATA (SEQ ID NO:40). The 
Right primer 5' UTR was CAAGGGGTCAGGCGTGCTAG (SEQ ID N0:41) 

The cDNA product: 571-bp and the Genomic DNA product was 1262-bp. The 
/l/'i -specificity of the 5'UTR primer was confirmed by sequencing the PCR amplification 
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products. Hybridization of the PCR product with Southern blots of T. monococcum 
indicated that API was a single copy gene In T. monococcum. 
AGLGl 

The Left primer Exon 2 was GAGGATTTGGCTCCACTGAG (SEQ ID NO:42). The 
Right primer. Exon 7 outside K-box was TCTAGGGCCTGGAAGAAGTG (SEQ ID NO:43). 
The cDNA product was 302-bp in length. The Genomic DNA product was 901- 

bp. 

The y4(jZ.(7i-speclficity of these primers was confirmed by sequencing the PCR 
amplification products. Hybridization of the PCR product with Southern blots of T. 
monococcum indicated that AGLGl was a single copy gene in T. monococcum. 

ACTIN 

The Left primer, Exon 3 was ATGTGGATATCAGGAAGGA (SEQ ID N0:44). The 
Right primer, Exon 3 was CTCATACGGTCAGCAATAC (SEQ ID NO:45) 
The cDNA product: 85-bp 

b) Quantitative PCR 

Tests for amplification efficiency were performed. Six 2-fold dilutions were 
tested in triplicate; 1:1, 1:2, 1:4, 1:8, 1:16, 1:32. Standard curves were plotted with ng 
RNA on the X-axis and aCt on the y-axis. The slope and the differences in slopes with 
the IBS standard curve were determined. The criteria for passed test was set as the 
differences of slopes being < 0.1. The calculation of the efficiency based on the slope 
was also plotted. 

ACTIN TaqMan System 

The Left primer was: ATGGAAGCTGCTGGAATCCAT (SEQ ID NO:46). The Probe 
(reverse orientation) was CCTTCCTGATATCCACATCACACTTCATGATAGAGT (SEQ ID 
NO:47) 

The Right primer is: CCITGCTCATACGGTCAGCAATAC (SEQ ID NO:48) 

The sequence of Actin exon 3 Is: 
GAGAAGAGCrATGAGCrGCCTGATGGGCAGGTGATCACCATTGGGGCAGAGAGGTTCCGTTG 
CCCTGAGGTCCTTTTCCAGCCATCTTTCATTGGTATGGAAGCTGCTGGAATCCATGAGACCAC 
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CTACMCTCTATCATGMGTGTGATGTGGATATCAGGMGGATCTGTATGGTAACATCGTGCT 
CAGTGGTGGCTCAACTATGTTCCCGGGTATTGCTGACCGTATGAGCAAGGAGATCACTGCCCT 
TGCACCAAGCAGCATGAAGATCAAGGTGGTGGCACCGCCTGAGAGGAAGTACAGTGTCTGGA 
mGAGGGTCGATrCTTGCCTCCCTTAGTACCTTCCAACAG (SEQ ID NO:49) 

The differences of the slopes with 18S was determined to be 0.0352. The actin 
system passed the efficiency test with an efficiency of 99.1. 
API TaqMan System 

The Left primer was: AACTCAGCCTCAAACCAGCTCTT (SEQ ID NO:50). The 
Probe (reverse orientation) was CATGCTGAGGGATGCTCCCCCTG (SEQ ID N0:51). The 
Right primer was CTGGATGAATGCTGGTATTTGC (SEQ ID NO:52). 

The API T. monococcum sequence is: 
CTCGTGGAGAAGCAGAAGGCCCATGCGGCGaVGCAAGATCAAACTCAGCCTCAAACCAGCTCT 
TCIICI ICI ICCTTCATGCTGAGGGATGCTCCCCCTGCCGCAAATACCAGCATTCATCCAGCGG 
CGGCAGGCGAGAGGGCAGAGGATGCGGCAGTGCAGCCGCAGGCCCCACCCCGGACGGGGCTT 
CCACCGTGGATGGTGAGCCACATCAACGGGTGA (SEQ ID NO:53) 

The differences of the slopes with 18S was determined to be 0.0056. The actin 
system passed the efficiency test with an efficiency of 96.3. 

UBIQUITINJaqMan System 

The Left primer was: ATGCAGATCTTTGTGAAGACCCTTAC (SEQ ID NO: 54). The 
Probe was: CAAGACCATCACTCTGGAGGTTGAGAGCTC (SEQ ID NO:55). The Right 
primer GTCCTGGATCTTGGCCTTGA (SEQ ID NO:56) 

The sequence of Ubiquitin is: 
ATGCAGATCTTTGTGAAGACCCTTACTGGCAAGACCATCACTCTGGAGGTTGAGAGCTCAGAC 
ACCATCGACAATGTCAAGGCCAAGATCCAGGACAAGGAGGGCATCCCCCCGGACCAGCAGCGC 
CTCATOTCGC:AGGAAAGCAGCTGGAGGATGGCCGCA(n'CTrGCTGACTA(:AACATCOVGM 
GAGTCCA(TCTrCACCTTGTCCTGCGTCTTCGTGGCGGT (SEQ ID NO: 57) 

The differences of the slopes with 18S was determined to be 0.0292. The actin 
system passed the efficiency test with an efficiency of 99.4. 
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Additional deletions in the Promoter Region of API 

PCR primers for the promoter region flanl<lng tlie 20-bp deletion present in the 
spring genotype G2528 were used to screen a collection of 65 accessions of cultivated 
T. monococcum ssp. monococcum. None of the winter accessions showed deletions in 
this region. Among the accessions with spring growth habit, three (PI-349049, 
PI326317, and PI 418582) showed a 34-bp deletion, one (PI-355515) showed a 48-bp 
deletion and one showed a 1 nucleotide change in the CArG box (Figure 11). 

The Primers used to screen the T. monococcum collection were: 
APl_ProDel_Fl\ ACAGCGGCTATGCTCCAG (SEQ ID NO:58) and APl_ProDel_Rl\ 
TATCAGGTGGTTGGGTGAGG (SEQ ID NO:59). The expected size without deletion Is 
152 bp 

Deletions are Illustrated in Figure 11. Accessions carrying the new deletions can 
be crossed with winter T. monococcum ss^. toeticum G3116 to determine the linkage 
between these deletions and growth habit. A detailed sequence analysis of the allelic 
variation at the Vrnl and Vm2 loci In this collection can be prepared by procedures 
available to those of skill in the art. 

The CArG-box was confirmed as a critical site for the recognition of the 
vernalization signal. The sequence of spring Triticum monococcum accession number 
PI503874 showed the presence of a 1-pb deletion In the CArG-box of the promoter of 
the API gene. The normal CArG box is CCCTCbl II IGG and the sequence In PI503874 
was CCCT-b 1 1 1 1 GG. The sequence of the promoter region T. monococcum PI 503874 
is provided in Figure 11 (SEQ ID No: 17). This one base pair deletion was the only 
observed difference with the promoters from other T. monococcum accessions with 
winter growth habit. Thirty-four plants were winter and 110 were spring. All the spring 
plants were homozygous or heterozygous for the presence of the one-base pair 
deletion, whereas the 34 winter plants were all homozygous for the absence of the one- 
bp deletion. This confirms complete linkage between the 1-bp deletion and the spring 
growth habit. 
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Marker development 

In the initial genetic map (Dubcovsky, J., etal. (1998)) the VRNl gene was 
flanked in the distal side by WG644 and in the proximal side by CDO708. These markers 
were used as anchor points to the rice genome sequence to find additional markers. 

Distal region: WG644 was previously used to identify rice BAG 3615 that included 
GENEl at its proximal end (Dubcovsky, J., 5/(2001). BLASTN searches of the 
different rice genome projects using GENEl and the end sequence of BAG 3615 
(AY013245) identified the connected contig GL013482.168 (Goff, S. A., etal. (2002)). 
Two additional genes, Phytochelatin synthetase {PCS, Zea /77a/sAAF24189.1) and 
Cytochrome BSiCYBS, NP_1 73958.1), were discovered and annotated in this new 
contig. These genes were mapped in wheat by RFLP (Rgs. 1 and 2A-2B). 

Proximal region: RFLP marker GDO708 was mapped 0.9 cM proximal to VRNl In 
the T. monococcum map. The sequence of this clone showed a high homology to a 
putative RNA-binding protein (AAL58954.1) located in rice BAG AG091811. The end of 
this rice BAG also included gene A/77r4 (putative protein kinase tousled, AAL58952.1) 
that was converted Into a PGR marker and was mapped In wheat (Fig. 1). Rice BAG 
sequence AG091811 was then connected through contigs GL039395.93, GL039395.83, 
and GL018222.111.1 (Goff, S. A., etal. (2002)) to rice BAG sequences AG092556 and 
AF377947. BAG sequence AG092556 included a Transcriptional Adaptor gene {ADA2, 
AJ430205) that was mapped In T. monococcum 0.5 cM from the VRNl gene (Fig.l). 
The last rice BAG sequence AF377947 Included genes Phytochrome-C{PHY-C, 
AAM34402.1), Cysteine proteinase {CYS, AAM34401.1) and MADS-box genes 
AAM34398.1 and AAM34397.1, designated hereafter API and AGLGl (>1<yi-like gene 
from Gl-asses). The rice proximal region Included 318-kb of contiguous sequence. 

High-density genetic maps of tiie VRNl region 

The PGR markers developed for GENEl and M7K4\Nere used to screen 6,190 
chromosomes for recombination. Fifty-one recombinant events were detected, and 
those plants were further characterized using molecular markers for all the genes 
present between these two markers in rice (Fig. 1 and Figs. 2A-2B). Progeny tests were 
performed for 30 of the 51 F2 plants, to determine the VRNl genotype of the parental 
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F2 plants. Based on the mapping information, the VRNl locus was completely linked to 
API and AGLGl. 

On the proximal side, genes PHY-C and CK5 flanked the VRNl locus. The last 
two genes were completely linked to each other and separated from VRNl by a single 
crossover (Fig. 1, Figs. 2A-2B). On the distal side, the CYBSgene was also separated 
from VRNl by a single crossover. Comparison of genotypic and phenotypic data from all 
the F3 plants used in the 30 progeny tests confirmed that the observed segregation in 
growth habit was determined by variation at the VRNl locus. Unvernalized plants 
homozygous for the G1777 API allele flowered 1-2 months later than G2528 whereas 
the other plants flowered only one week before or after the G2528 control. These 
results confirmed the simple Mendelian segregation for vernalization requirement in this 
cross (Dubcovsky, J., eta/. (1998)). 

Physical maps 

Distal contig: Genes CKfi5 and GENEl were used to screen the BAC libraries from 
T. monococcum, rice and sorghum. Tritlcum monococcum BAC clone 609E6 selected 
with the OHJgene was connected to previously sequenced 116F2 (AF459639) by four 
BACs (Fig. 1). The PCSqene hybridized with two fragments from BAC 609E06 {PCSl 
and PCS2, Fig. 1) whereas only one PCS copy was found in the colinear region in rice. 
No single copy probes were found in BACs 609E6 or in the unique Hind III fragments 
from the most proximal BAC 393011 to continue the chromosome walking towards the 
proximal region. 

Proximal contigr. Screening of the T. monococcum BNZ library with PHY-Q CYS, 
API, and AGLGl yielded twelve BACs organized in two contigs. The largest contig 
included eight BACs that hybridized with genes PHY-Q CYS, and API. The four 
additional BACs hybridized only with the AGLGl gene (Rg. 1). The location of the 
AGLGl contig within the physical map was determined by the complete linkage between 
the single copy genes AGLGl and API and the proximal location of AGLGl relative to 
single copy gene CYB5. No additional single copy probes were found to close the gaps 
flanking the AGLGl contig. 
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The proximal gap between AGLGl and API was covered by the current rice 
sequence. However, the distal gap between CYfiS" and AGLGl was also present in the 
different rice genonfie sequencing projects. The screening of the NIpponbare BAC 
libraries with probe failed to extend the rice region because of the presence of a 
gap in the current rice physical maps. Fortunately, sorghum BAC 17E12 included 
GENEl, PCSl, PCS2, and CKH5" genes from the distal contig, and AGLGl, API, and the 
CK5 genes from the proximal contig, bridging the gap present in the rice and wheat 
contigs (Fig. 1). A restriction map of sorghum BAC 17E12 (Fig. 6) indicated that the 
sorghum genes were in the same order as previously found in rice and wheat and that 
a 24-kb Swa \-Swa I restriction fragment spanned the region of the rice and wheat gap 
between CKfiJand AGLGl. 

Sequence analysis 

Annotated sequences from the three T. monococcum BACs (AY188331, 
AY188332, AY188333) and the partial sequence of the sorghum BAC 17E12 (AY188330) 
were deposited in GenBanl<. Including BACs 115G01 and 116F02 (AF459639) a total of 
550-kb were sequenced. Multiple retrotransposons organized in up to four layers of 
nested elements were the most abundant features, similar to wheat regions analyzed 
before (Wicker, T., etaL (2001), SanMiguel, P., etal. (2002)). Retrotransposons and 
other repetitive elements accounted for 78.4% of the annotated sequence whereas 
genes represented only 8.5% of the total. The genes detected in this sequence were in 
the same order as the ones present in the corresponding regions in rice and sorghum, 
indicating an almost perfect microcollinearity. The only exception was the duplication of 
the PCS' gene in sorghum and wheat relative to the presence of a single /'CSgene in 
the colinear rice region (Fig. 1). 

No additional genes were found in the rice sequence between the two MADS-box 
genes corresponding to one of the two gaps in the wheat physical map. These two 
genes were also adjacent in sorghum (Fig. 1, and Fig. 6). Similarly, no new genes were 
found between 0^5" and AGLGl in the sequence of the 24-kb Swa \-Swa I restriction 
fragment from sorghum BAC 17E12 (AY188330) that covered the other gap in the 
wheat physical map. The four genes present in the sorghum sequence were In the 
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same order and orientation as previously found in rice and wheat (Fig. 6). Figure 6 
sliows: 1) No additional genes were detected between CYBSand AGLGl. 2) No similar 

sequences were detected between the intergenic regions in sorghum and the colinear 
sequences in rice or wheat. 3) Genes In sorghum were in the same order as in wheat 
and rice. 

The absence of new genes In the colinear regions of rice and sorghum, together 
with the excellent microcolllnearlty detected In this region, suggested that It would be 
unlikely to find additional genes in the current gaps of the wheat physical map. This 
assumption was also supported by the absence of any new gene In the 324-kb of wheat 
sequence flanking these gaps. The presence of almost uninterrupted series of nested 
retrotransposons flanking the gaps also explained the failure to find single copy probes 
to close the two gaps. 

Classification of the two MADS-box genes 

The API and AGLGl proteins have MADS-box and K domains characteristic of 
homeotic genes Involved in the flowering process and similar exon structure (Fig. 1, Fig. 
6) (Ng, M., etal. (2001)). The consensus tree for 26 plant MADS-box proteins (Fig. 3) 
showed that the closest proteins to wheat API and AGLGl belonged to the SQUAMOSA 
(bootstrap 97) and AGL2 groups (bootstrap 95) respectively. 

The closest Arabidopsis MADS-box proteins to wheat API were the proteins 
coded by the three related meristem Identity genes API, CAL and FUL (Figure 3). Two 
separate clusters were observed In the SQUAMOSA group dividing the Arabidopsis and 
grass proteins. A similar separation between the monocot and dicot proteins was found 
in more detailed studies of this group (Johansen, B., eta/. (2002)). The API protein 
from T. monococcumvias 98.4% similar to previously described T. aestivum WAPl, 
formerly TaMADS#ll (Mural, K., etaL (1997), Mural, K., etaL (2002)), and 96.0% 
similar to barley BM5 (Schmitz, J., etal. (2000)). These two putative orthologous 
genes were described in papers characterizing the MADS-box family in wheat and 
barley, but were not mapped or associated with the VRNl gene. 
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The wheat AGLGl protein was clustered with members of the AGI-2 subgroup 
and was closely related with the rice AGLGl orthologue and with rice OsMADSS, 
OsMADSl and barley proteins (bootstrap 87, Fig. 3). 

Expression profiles 

No API transcripts were detected in apices from unvernalized plants of T. 
monococcumyNxth strong winter growth habit (G3116) even after ten months In the 
greenhouse under long day conditions. However, API transcription was detected in the 
apices of plants from the same genotype after six weeks of vernalization (Fig. 4A, lanes 
6 and 7). The same result was obtained in three independent experiments. These 
apices were morphologically at vegetative stage zero according to the developmental 
scale of Gardner et al. (1985). In T. monococcum accessions with spring growth habit, 
API transcripts were observed in the apices without the need of previous vernalization. 

Developmental stage of the apexes used in the RT-PCR experiment 

After six weeks of vernalization the shoot apexes did not show any morphological 
sign of differentiation from the vegetative shoot apex stage as observed before 
vernalization. An apex from winter T. monococcum accession G3116 after six weeks of 
vernalization was visualized. The results showed that the expression of Apl in the 
apices precedes the differentiation of the apex. 

Transcripts of API were also detected in the leaves, as previously reported for 
mPl (Mural, K., eta/. (1997)) and BM5{Schm\tz, J., (2000)). A quantitative PGR 
experiment using the endogenous controls ACTJN and UBIQUITIN demonstrated that 
transcription of API in the leaves of the winter genotypes was also regulated by 
vernalization. 

Effect of vernalization on ^ CTIN and UBIQUITIN transcription levels 
Average threshold cycle (Cx ) 





n 


Cold room Ct 


n 


Greenhouse Cj 


ANOVA 


A CTIN 


15 


17.4 


30 


17.5 




UBIQUITIN 


14 


15.1 


29 


15.3 


P=0.55 



No significant differences were detected between plants in the greenhouse and 
plants in the cold room in the Ct values of /lC77/Vand UBIQUITIN. The abundance of 
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API transcripts started to increase after the first two weeks of vernalization and 
continue increasing during the four additional weeks of the vernalization process (Fig. 
4B). 

The API transcription levels relative to ubiquitin are presented in Figure 5. 
Samples were extracted from the emerging and first fully expanded leaves of Triticum 
monococcum G3116 (winter growth habit) 1) Before vernalization, 2) 2 weeks in the 
cold room, 3) 4 weeks In the cold room, 4) 6 weeks In the cold room, 5) two weeks 
after the vernalized plants were returned to the greenhouse. Units are linearized values 
using the 2^ '^'^^ method, where CT is the threshold cycle. The results show that API 
transcripts were also present in the leaves from vernalized plants two weeks after their 
transfer to the greenhouse. 

Control plants kept In the greenhouse showed very low level of API transcription 
during the eight weeks of the vernalization experiment (Fig. 48). In the genotypes with 
a spring growth habit, API transcripts were observed In the leaves of unvernalized 
plants that were Initiating the transition to flowering. 

API transcription levels In leaves of different ages are shown in Figure 5. One- 
month-old G3116 plants were vernalized for six weeks and then transferred to the 
greenhouse for two weeks under long day conditions. RNA was extracted from each of 
the five green leaves from the main stem and one secondary tiller from two plants. In 
Fig. 5, number 1 Indicates the youngest leaf (not fully emerged from the sheath) and 
number 5 the oldest green leaf. Bars represent standard errors of the means. Ubiquitin 
was used as an Internal control. Units are linearized values using the 2^ '^^ method, 
where CT is the threshold cycle. The results show that API transcripts were detected In 
young and old green leaves. 

A relatively high level of expression of Apl was observed in all the leaves. 
Average Ct values for Apl (24.1) were only two cycles higher than for Ubiquitin (22.0). 
This result confirmed that Apl induction by vernalization was not restricted to the 
youngest leaves. Marginally significant differences (ANOVA, P= 0.05) were observed 
between leaves of different ages, with the highest value for leaf 1 
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AGLGl transcripts were detected only in young spikes (Fig. 4A, lane 8), but were 
not observed in the same cDNA samples from apices after six weel<s of vernalization 
wliere the API transcripts were already present (Fig. 4A). This Indicates that AGLGl 
transcription is initiated later than API. Transcripts from AGLGl were not detected in 
the leaves (Fig. 4A). 

The expression results together with the known role of the API homologues in 
Arabidopsis as meristem identity genes, suggested that API was a better candidate 
gene for VRNl than AGLGl. 

Allelic variation 

Four API genes were sequenced from T. monococcum accessions G1777, G3116, 
and DV92 carrying the vrnl allele and G2528 carrying the Vrnl allele. The nucleotide 
sequences for G2528 and DV92 are presented In Figure 7. The predicted proteins from 
DV92 and G2528 were identical and differed from the predicted proteins from G3116 
and G1777 by a single amino acid (Figure 8). 

Analysis of the 1024-bp region upstream from the API start-codon and up to the 
insertion point of a large repetitive element (AY188331) showed the presence of five 
polymorphic sites. Two of them differentiated G2528 from the three accessions carr/ing 
the vrnl allele for winter growth habit. One was a one bp insertion located 728-bp 
upstream from the start codon and the other one was a 20-bp deletion located 176-bp 
upstream from the start codon (Fig. 1, Figures 9A-9B, 11). No difference were detected 
in the first 600-bp of the API 3' region between the vrnl and Vrnl alleles. 

A PGR screening of a collection of cultivated T. monococcum accessions with 
primers flanking the 20-bp deletion region revealed the presence of deletions of 
different sizes in agarose gels (Figure 10). Sequencing of these lines showed the 
presence of two new deletions of 34-bp and 48-bp that overiapped with the 20-bp 
deletion from G2528. these new deletions included a putative MADS-box protein- 
binding site adjacent to the 20-bp deletion (Fig. 1, Figure 11). A 1-bp deletion within 
the CArG box (Fig. 11) was found in 7 T. monococcum accessions with spring growth 
habit. We confirmed by genetic analysis of segregating F2 populations that the 34-bp 
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and 1-bp deletion were completely linked with the spring growth habit. New crosses 
have been made to analyze the 48-bp deletion. 

No DNA differences were detected between accessions DV92 (vrnl) and G2528 
( Vrnl) In the coding region, or the 5' (365-bp) and 3' (583-bp) untranslated regions of 
the AGLGl gene. 

Discussion 

Genetic and pliysical maps of tlie VRNl region 

Only eight genes were found in the 556-kb of sequence from the T. monococcum 
VRNl region, resulting in an estimated gene density of one gene per 70-kb. The low 
gene density observed in this region was paralleled by a high ratio between physical 
and genetic distances. Excluding the two gaps in the physical map, a minimum ratio of 
6,250-kb cM"^ was estimated for the region between WG644 and PHY-C. This value is 
two times larger than the average genome-wide estimate of 3,000-kb cM'^ (Bennett, M. 
D., etal. (1991)) and four times larger than the 1,400-kb cM'^ reported for the 
telomeric region of chromosome lA (Stein, N., etal. (2000)). Previous cytogenetic 
studies demonstrated that recombination in the wheat chromosomes decrease 
exponentially with distance from the telomere (Dvorak, J., etal. (1984), Lukaszewski, 
A. J,, etal. (1993)), predicting an increase of the ratio between physical and genetic 
distance In the same direction. The region studied here Is located between the 
breakpoints in deletion lines 5AL-6 (FL 0.68) and 5AL-17 (FL 0.78), In a more proximal 
location than regions used before to estimate ratios between physical and genetic 
distances in wheat. This result suggests that positional cloning projects in the proximal 
regions of wheat will be difficult and would greatly benefit from the use of the rice 
genomic sequence to jump over large blocks of repetitive elements. 

In spite of the low recombination rate found in this region, the large number of 
evaluated gametes was sufficient to find crossovers between most of the genes or at 
least between pairs of adjacent genes. This detailed genetic study showed that the 
variation in growth habit determined by the VRNl gene was completely linked to only 
two genes. Although the possibility that additional genes would be found in the two 
current gaps and unsequenced regions of our T. monococcum physical maps cannot be 
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ruled out, this seems unlikely based on the comparative studies with rice and sorghum 
and the absence of any additional genes in the 324-kb of wheat sequence between 
CYBSandCrS. 

The genetic data reduced the problem of the identification of VRNl to the 
question of which of the two MADS-box genes was the correct candidate. However, 
since no recombination was found between AGLGl and API it was not possible to 
answer this question based on the available genetic results. Therefore, the relationship 
between AGLGl and API with MADS-box genes from other species was established as a 
first step to predict their function from the known function of the related genes. 
Phylogenetic relationships of tlie VRNl candidate genes 
The similarity between the wheat API gene and the Arabidopsis meristem 
identity genes API, CAL, and FUL provided a first Indication that the wheat API gene 
was a good candidate for VRNl. These Arabidopsis genes are expressed in the apices 
and are required for the transition between the vegetative and reproductive phases 
(Ferrandiz, C, etal. (2000)). The triple Arabidopsis mutant apl-cal-ful r\&^Qr flowers 
under standard growing conditions. In wheat, the VRNl gene Is also responsible, 
directly or indirectly, for the transition between vegetative and reproductive apices. This 
transition is greatly accelerated by vernalization in the wheat plants carrying the vrnl 
allele for winter growth habit. Therefore, it is reasonable to speculate that the sequence 
similarity between the wheat API gene and the Arabidopsis meristem Identity genes 
may indicate similar functions. An evolutionary change In the promoter region of API 
may be sufficient to explain the regulation of API by vernalization In wheat (see model 
below). 

The close relationship of wheat AGLGl to members of the AGL2 subgroup 
suggested that AGLGl was a less likely candidate for VRNl than API because 
transcripts from genes Included In this group are usually not observed In the apices in 
the vegetative phase (Johansen, B., etaL (2002)). Expression of Arabidopsis AGL2, 
AGL4ar\6 AGL9 begins after the onset of expression of floral meristem identity genes 
but before the activation of floral organ Identity genes suggesting that members of the 
AGI-2 clade may act as intermediaries between the meristem Identity genes and the 
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organ identity genes (Flanagan, C. A., etal. (1994), Savidge, B., etal. (1995), Mandel, 
1^. A., etal. (1998)). Tliis seems to be valid also for OsMADSl, which is more closely 
related to AGLGl than the Arabldopsis members of the AGL2 clade. In situ hybridization 
experiments of young rice inflorescences with OsMADSl, showed strong hybridization 
signals in flower primordia but not in other tissues (Chung, Y. Y., etal. (1994)). 

If the functions of wheat APtan6 AGLGl were similar to the function of the 
related genes from Arabidopsis, the initiation of transcription of API should precede the 
initiation of transcription of AGLGl In wheat. 

Transcription profiles of the VRNl candidate genes 

RT-PCR experiments using FU^A samples from vernalized apices showed 
transcription of API but not of AGLGl (Fig. 4A) indicating that transcription of AGLGl 
occurs after the initiation of transcription of API. The similar timing and order of 
transcription suggests that the wheat genes might perform similar functions to the 
related Arabidopsis genes. 

It could be argued that any gene in the flowering regulatory pathway would be 
up regulated by the initiation of flowering caused by the vernalization process. 
However, the up regulation of API transcription in the leaves by vernalization (Fig. 4B) 
indicated a more direct role of the vernalization pathway in the regulation of wheat API 
gene. Four additional characteristics of the transcription profile of API paralleled the 
predicted expression of a vernalization gene. First, vernalization was required to Initiate 
API transcription in the plants with winter growth habit but not in the plants with spring 
growth habit. Second, API transcription was initiated only after two weeks In the cold 
room, and a minimum of two weeks of vernalization is required by many winter wheat 
varieties to produce any significant acceleration of flowering (Limin, A. E., etal. 
(2002)). Third, the progressive increase of API transcripts after the second week of 
vernalization (Fig. 48) is consistent with the progressive effect of the length of the 
vernalization period In the acceleration of flowering time. Finally, a high level of API 
transcripts was observed after the plants were moved from the cold to room 
temperature indicating that API is not just a cold stress induced gene. 
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Allelic variation 

No differences were found in the AGLGl coding region or in its 5' and 3' regions 
between T. monococcum accessions G252S (Vrnl) and DV92 {vrnl) confirming tliat 
AGLGl was not a good candidate to explain tlie observed differences in growth liabit. 

Altliough no differences were detected in tlie API coding sequences and 3' 
region, the spring and winter accessions differed in their promoter sequence. The first 
600-bp upstream from the start codon were identical among the four genotypes 
analyzed in this study except for a 20-bp deletion located close to the start of 
transcription and adjacent to a putative I^ADS-box protein binding site (CArG-box) in 
G2528 (Tilly, J. J., eta/. (1998)) (Figure 4). Two additional overlapping deletions were 
discovered in the same region of the promoter in spring accessions of cultivated T. 
monococcum {Figure 11) and a 1-bp deletion was found by sequencing accession 
PI503874 (SEQ.ID NO: 17). The presence of a putative CArG-box in this region 
suggests the possibility that a trans-acting factor may bind to this site and repress API 
transcription until vernalization occurs. This is similar to the case for FLC'm Arabidopsis, 
which was recently shown to bind to I^ADS-box gene SOCl and repress its transcription 
prior to vernalization (Hepworth, S. R., eta/. (2002)). 

A model for the regulation of flowering by vernalization in wheat 

The results presented in this study can be included in an integrated model (Fig. 
12) based on the known epistatic interactions between l/7?/Vi and W?/V2(Tranquilli, G. 
E., eta/. (1999)) and the available information about the evolution of the vernalization 
requirement in the Triticeae. The significant epistatic interactions observed between 
VR/\/l and VR/\/2 Indicate that these two genes act in the same pathway. According to 
the model presented here (Fig. 12), l//?/V2 codes for a dominant repressor of flowering 
that acts directly or indirectly to repress VRNL As the vernalization process reduces the 
abundance of the \/R/\/2gene product, VRA/l transcription gradually increases leading to 
the competence to flower (Fig. 13, center). 

The growth habit of plants homozygous for the recessive vm2a\\e\e for spring 
growth habit (Fig. 12, upper panel) Is independent of variation at the VR/\/l locus. 
According to this model, the 1^/772 allele represents a null or defective repressor that 
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cannot interact with tlie VRNl promoter. Tlierefore, variation in tfie promoter of tlie 
VRNl gene would have no effect on flowering time in homozygous vm2 plants. This can 
be illustrated by the expression pattern of API in T. monococcum DV92 {vrnl vrn2). In 
this genotype, the initiation of API transcription in leaves and apices did not require 
vernalization in spite of the presence of a recessive vrnl allele. This result indicated that 
the VRNl gene acts downstream of l//?/V2(Fig. 12). 

Conversely, plants homozygous for the Vrnl allele for spring growth habit 
showed no significant effects of the VRNZqeuQ on flowering time. According to the 
model in Fig. 12 (lower panel), the VRN2 repressor will have no effect on flowering in 
genotypes carrying the Vrnl allele because of the lack of the recognition site in the 
VRNl promoter region. This part of the model can be used to explain the API 
expression profile of G2528 ( Vrnl Vrnl). In this genotype, transcription of API in leaves 
and apices is Initiated without a requirement for vernalization in spite of the presence of 
an active VRN2 repressor. This suggested that the active repressor could not interact 
with the G2528 API promoter region, possibly because of the presence of the 20-bp 
deletion. 

This model also provides an explanation for the parallel evolution of VRNl spring 
alleles in three different Triticeae lineages. A vernalization gene with a dominant spring 
growth habit has been mapped in the same map location in diploid wheat (Dubcovsky, 
J., eta/. (1998)), barley (Laurie, D. A., eta/. (1995)), and rye (Plaschke, J., eta/. 
(1993)). Most of the wild Triticeae have a winter growth habit suggesting that the 
recessive vrnl allele is the ancestral character (Kihara, H., eta/. (1958), Halloran, G. 
M., eta/. (1967), Goncharov, N. P., eta/. (1998)). This is also supported by the fact 
that it is unlikely that a vernalization requirement would be developed independently at 
the same locus in the three different lineages from an ancestral spring genotype. 
According to the model presented here. Independent mutations In the promoter regions 
of winter wheat, barley, and rye genotypes have resulted in the loss of the recognition 
site of the VRN2 repressor (or an intermediate gene) and therefore. In a dominant 
spring growth habit ( Vrnl allele). Since this is a loss rather of a gain of a new function 
it Is easier to explain its recun-ent occurrence in the different Triticeae lineages. 
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In summary, this invention presents the delimitation of the candidate genes for 
Vrnl to API and AGLGl by a high-density genetic map, and the identification of API as 
the most likely candidate based on Its similar sequence to meristem identity genes, its 
transcription profile, and its natural allelic variation. The model is presented to integrate 
the results from this study with the previous knowledge about the epistatic interactions 
between vernalization genes and the evolution of vernalization in the Triticeae. 

EXAMPLE 2 

Introduction 

Genes controlling vernalization requirement prevent flower development during 
the cold months of winter, providing protection for the environmentally sensitive floral 
organs. The proper timing of the transition from the vegetative to the reproductive 
stage is critical to the reproductive success of a species and is under the regulation of a 
complex gene network (G. G. Simpson etal. (2002), A. Mouradov etal. (2002)). 

The vernalization pathway is an important part of this regulatory network, and 
has been studied with great detail in Arabidopsis. The FZ.Cgene plays a central role in 
this pathway by integrating the signals from the extended cold treatment with signals 
from the autonomous flowering pathway (S. D. Michaels, etal. (1999), C. C. Sheldon, 
etal. (1999)). A high level of expression is required to maintain a vegetative status. 
Another important gene in the Arabidopsis vernalization pathway is FRI^ which 
upregulates FlCtranscription (S. D. Michaels, etal. (1999), U. Johanson etal. (2000)). 
Vernalization produces the opposite effect, and results in the permanent downregulated 
of FLC{S. D. Michaels, etal. (1999), C. C. Sheldon, etal. (1999)). Two genes, recently 
designated VRNl and VRN2bvq required to keep FLC\r\ its repressed status, but not for 
Its initial repression by cold (A. R. Gendal etal. (2002)). We suggest renaming the 
Arabidopsis genes as VRNf^Bu6 VRN2^\o avoid confusion with the main vernalization 
loci in wheat, VRNl and VRN2, which correspond to different genes (See Example 1) 
and were assigned these names before (R. A. Mcintosh, etal. (1998)). The signals from 
the vernalization pathway converge with those from the photoperiod pathways at the 
regulatory regions of the SOCl and /Tgenes (G. G. Simpson, etal. (2002), A. 
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Mouradov, etal. (2002)). FZ.C binds to the promoter of SOCl and impairs its activation 
by C(9(S. R. Hepwortfi, etal. (2002)), a central gene in the photoperiod pathway (P. 
Suarez-Lopez, etal. (2001)). CO activates SOCiand /T(S. R. Hepworth, etal. (2002)), 
which then interact with other genes to induce the meristem identity gene API, 
initiating the transition between the vegetative and reproductive apex. 

CO has different functions in rice and Arabidopsis. This gene promotes flowering 
under long days in Arabidopsis but it represses flowering under this conditions in rice 
(P. Suarez-Lopez, etal. (2001)). In addition, no clear homologues for FRIor FLCvjere 
found in the rice genome (S. A. Goff etal. (2002)). Although this may be expected 
based on lack of a vernalization requirement in rice, the absence of FRI and FLC 
homologues in the extensive wheat and barley EST collections («770,000) suggested 
the possibility that the temperate grasses used a different set of genes to develop their 
vernalization requirement. Since temperate cereals evolved from subtropical primitive 
grasses (W. D. Clayton, etal. (1986)) it is possible that the development of the 
vernalization pathway in the winter cereals evolved independently of the vernalization 
requirement in Arabidopsis. In this Example, we demonstrate the positional cloning and 
characterization of wheat H^/V^gene, and demonstrate that the genes included in the 
vernalization pathway in the temperate cereals are different from those in Arabidopsis. 

Positional cloning of wheat vernalization gene VRJ\/2 

In a previous study we mapped wheat vernalization gene VRN2 \r) the long arm 
of chromosome 5A using a segregating population from the cross between T. 
monococcumU\l92 {yrnlvrnl, spring) and G3116 {vrnlVrn2, winter) (J. Dubcovsky, et 
al. (1998)). We found strong epistatic Interactions between this gene and VRNl, 
indicating that both genes were part of the same regulatory pathway (G. E. Tranquilll, 
etal. (1999)). Similar epistatic interactions were found In barley (R. Takahashi, etal. 
(1971)) and both genes were mapped in colinear chromosome locations, suggesting 
that wheat and barley vernalization genes were orthologous (J. Dubcovsky, etal. 
(1998), D. A. Laurie, etal. (2002)). 
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In this Example, we developed a high-density map based on 2,849 unvernalized 
F2 plants from the DV92 x G3116 cross, as a first step for the positional cloning of 
VRN2. We used l//?/V2 flanking markers NUCELUNdx\6 UCW22 to determine the 
genotype of each of the F2 plants and to find 18 recombination events within this region 
(Fig. 13). Note: Probe Nucellin is a 832-bp region between exons 4 and 6 amplified by 
primers NucellinEx6L (CTTCACGAAGAGGTAGTTTTGAGG (SEQ ID NO: 72)) and 
NucellinEx4U (tgggtacaagcaggaggagc (seq id no: 73)) AR59084 (92-814- 
93,645) and probe UCw22 \s a 719-bp Sau3A I fragment (AF459088: 24,701 - 25,419) 
located between genes AF459088.2 and AF459088.3. The 2,831 plants that did not 
show recombination between the critical markers were used to confirm the location of 
W?/V2 within the NUCELUN-UCW22 vec;\ou. All plants homozygous for the DV92 allele 
showed a spring growth habit, whereas plants carrying G3116 alleles had a strong 
vernalization requirement when grown under long day conditions without vernalization. 
The simple Mendelian segregation of VRN2 in this mapping population facilitated the 
precise mapping of VRN2. We generated additional markers from the BAG clones 
Included in the physical map (Fig. 13) to define precisely the location of the two 
crossovers flanking the vernalization gene. The l//?/V2gene was finally mapped into a 
0.04-cM interval flanked by RFLP marker UCW22an6 PGR marker UCW2J (Fig. 13). 
Note: Probe UCW2J Is a 620-bp fragment with primers G171L 
(agtggcatcgttttcaggat (seq id NO: 64))andG171R 

(GCCATGCCGATAGCTGACTA (SEQ ID NO: 65) ) (AF459088: 338,499 - 339,118). 

The product from accession G3116 is digested into 505-bp and 115-bp fragments, 
whereas the DV92 PGR product is not digested. 

We constructed a complete physical map of the l//?/V2 region using the BAG 
library of T. monococcum accession DV92 (D. Lljavetzky, eta/. (1999)) and two steps of 
chromosome walking (Fig. 13). BAGs 258C22, 301G15, 405L8, and 455G17 were 
completely sequenced and annotated using procedures described before (P. SanMiguel, 
eta/. (2002)). Note: Probe UGW61 Is a 452-bp fragment amplified by primers V2L8F2 
CTGGATGGAAGACTTGGGATT (SEQ ID NO: 60)) and V2L8R2 
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(CTTCCTCGACCrCrCCACAG (SEQ ID NO: 61)) (AR59088: 191,044 - 191,496). It 
hybridized with a single copy fragment in genomic DNA of T. monococcum. UCW61 
was not included in the linkage map but was used to screen the T. monococcum BAC 
library and to select BAC 301G15 that closed the contig. The complete 438,828-bp 
annotated sequence was deposited in GeneBank under accession AR59088. We also 
sequenced the orthologous BAC 615K1 from barley variety Morex (AR59084) and BAC 
49F5 from rice Nipponbare (AF485811). 

Additional probes were as follows: A PCR marker for the second exon of the 
ZCCTl gene comprising a 231-bp fragment amplified with primers R3C1N3 
(gcaatcatgactattgacaca (SEQ ID NO: 62) ) and RACECINI 

(GGGCGAAGCTGGAGATGATG (SEQ ID NO: 63) ) (AF459088: 330,948 - 331,178). 

The PCR product from accession DV92 is digested by restriction enzyme Nco I into 189- 
bp and 42-bp fragments, whereas the G3116 PCR products is not digested. A SNF2P 
gene (encoding a global transcriptional regulator (L. Yan, etal. (2002)) probe 
comprising a 1,091-bp fragment between exon 14 and 15 with amplified with primers 

SNF2PExl4F (GGGTCATGGAGGAATGTTTG (SEQ ID NO: 66) ) and SNF2PExl5R 

(ttggcttctgcagagaggat (SEQ id NO: 67) ) (AF459088: 351,083 - 352,173). 
The PCR product from accession G3116 is digested by restriction enzyme EccR I Into 
900-bp and 200-bp fragments, whereas the DV92 PCR product is not digested. 
A SEC14 gene (AF459088.7 - encoding a protein similar to rice phosphatidylinositol / 
phosphatidylcholine transfer protein (AA020076.1) and Candida Glabrata SEC14 
cytosolic factor (CAA65985)) probe comprising a 305-bp fragment from the last exon of 
the SEC14 gene amplified with primers TmSEC14F (gttacgtgaactgtgacatc 

(SEQ ID NO: 68) ) and TmSEC14R (TCAGTTGCATGTCGACGAAGG (SEQ ID NO: 

69) ) (AF459088: 406,207 - 406,511). The resulting PCR product digested with 
restriction enzyme BatrM I produces a smaller fragment in DV92 than in G3116. A 
P450 gene (encoding a Cytochrome P450 protein) probe comprising a 376-bp fragment 
amplified with primers TmP450P3 (cgacgatgcccttccaaatg (seq id no: 70)) 
and TmP450P4 (tcaagcagctgctgcctccc (seq id no: 7i)) (AF459088: 
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432,795 - 433,170). The resulting PGR product digested with restriction enzyme Sad 
produces a larger fragment in DV92 than in G3116. 

Eight genes and one pseudogene were detected in the non-repetitive regions of 
the T. monococcum sequence, representing a gene density of one gene per 55-l<b and 
a ratio of genetic to physical distances of approximately 2.1-Mb per cM. Five of these 
genes where found in the same order and orientation in the barley BAG, and three in 
the rice BAG confirming the collinearity of these sequences (Fig. 13). The closest 
common genes flanking the VRN2qex\e, /'ZJSand SNF2P, were 7-kb apart in rice, 26-kb 
apart in barley, and 328-kb apart in T. monococcum (Fig. 13). 

The sequences from markers UCW22dir\A UCW2.1 flanking the VRN2qene in the 
genetic map (Fig. 13) were used to delimit a 315-kb candidate region within AF459088. 
Seventy-five percent of the sequence from this region was annotated as repetitive 
elements. We found only three genes completely linked to the l//?/V2gene. The first 
gene, designated AR59088.3, encoded a 254-amino acid protein that was 87% and 
96% similar to the orthologous proteins in rice and barley colinear BAGs respectively 
(Fig. 13). The AF459088.3 protein was 76% similar to Arabidopsis expressed protein 
AAD32834.1, which was close to SNF2P{\1-Vb)r as in the three grass species 
suggesting the conservation of a small colinear segment across the monocot-dicot 
divide. The AF459088.3 gene includes a conserved domain designated DUF614 
(pfam04749.2, E= 2e'^^) that Is present In different eukaryotic proteins of unknown 
function. 

We named the two other genes ZCCTl and ZCCT2 based on the presence of a 
putative zinc finger in the first exon and a CCT domain in the second exon. The CCT 
domain was named after CO, CO-like, and TOGl (J. Putterill, etal. (1995)), and is 
sufficient and necessary for the nuclear localization of CO In Arabidopsis (F. Robson, et 
al. (2001)). The proteins coded by the two other genes found in the l^/?/V2 region were 
76% identical, suggesting a duplication event that occurred approximately 14 + 3 
million years ago. Alignment of ZCCTl and ZCCT2 DNA sequences resulted in 629 
aligned base pairs. We found 22 transitions and 11 transversions in the 209 aligned 
base pairs at the third position. Using the average synonymous substitution rate of 6.5 



73 



X 10-9 substitutions/ synonymous.site/year calculated from the divergence of the adhl 
and aclh2qex\es in grasses (B. S. Gaut, etal. (1996)), we calculated that the duplication 
time of ZCCT gene in diploid wheat occurred approximately 13.9 ± 2.5 million years 
ago. 

A search of the Arabldopsis genome with the wheat ZCCT proteins showed that 
C(9and C^7^ike proteins were the most similar, but this similarity was restricted to the 
CCT domain (E= 2e'"). A similar search was performed in the rice genome, and CO 
related proteins AP005307 (Osl, E= Se^^) and AAL79780 (OsH, E= 2e"^^) showed the 
highest similarity values. The partial similarity of the ZCCT proteins to CO^ike proteins 
involved in the regulation of flowering time was the first indication of the potential of 
the ZCCT genes as candidates for VRN2. 

Evolutionary relationships between the ZCCT and CO-like genes 
Besides two ZCCT genes cloned from T. monococcum, we isolated additional 
ZCCT genes from the A genome of tetraploid wheat and from winter barley variety 
Diarokkaku, and compared their CCT domains with those from COIike genes in other 
plant species (Figure 18A). A recent study of the COIike gene family identified 17 
proteins in Arabldopsis, 16 in rice, and 9 in barley that were grouped in four major 
classes (I to IV) (S. Griffiths, etal. (2003)). We performed a Neighbor Joining cluster 
analysis using the 44-amino acids CCT motifs from the ZCCT proteins and from 
members of each of the different classes of COWVr proteins. CCT motifs from Group III 
{AtC0L9BX\(^ OsN) were very different and were used as an outgroup. The ZCCT 
proteins formed a separate group that was distantly related to members of Group IV 
proteins {,HvC09, Osl, OsH). These two groups were separated (bootstrap 73) from 
Groups I {AtCO, Hdl) and II {AtC0L6, Os]), which included genes with known effects 
on the regulation of flowering by photoperiod. No Arabldopsis protein was found within 
the ZCCT or Group IV clusters suggesting that this group of genes originated after the 
monocot-dicot divergence. In addition, no genes were found in the rice genome within 
the ZCCT cluster (Figure 18B). The absence of ZCCT genes in the colinear rice region 
flanked by genes AF459088.3 and SNF2P {Fig. 13) supports the results from the cluster 
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analysis suggesting tliat these genes were originated in the temperate cereals after the 
divergence with rice. 

Analysis of the putative zinc fingers confirmed the classification based on the 
CCr domains (Fig. 18B). CO-Wke proteins from groups II and I have one or two B-box 
zinc fingers with a conserved CX2CX15-16CX2C structure whereas the ZCCT proteins 
showed a single C2H2 zinc finger with a conserved CX2CX15-16HX3H structure. Zinc fingers 
from Group IV proteins were more similar, although not identical, to the zinc fingers 
from the ZCCT proteins. These observations suggest that the ZCCT genes probably 
originated from an ancestral CO-Wke protein from Group IV. However, their particular 
zinc finger motifs, differentiated CCT domains, and unique function justify the inclusion 
of these genes into a new gene group. 

Expression studies of the candidate genes 

Transcription levels of A/^59088.3\Nere not affected by vernalization (Fig. 14A), 
and no differences in expression were observed between spring and winter genotypes. 
Numerous Triticeae ESTs from dormant embryos, seedlings, roots, and young spikelets 
cDNA libraries showed significant similarity {E< e-100) to AF4S9088.3 suggesting a 
relatively high level of expression in numerous tissues. 

On the contrary, the absence of any ESTs corresponding to the ZCCT genes in 
the extensive wheat and barley collections (« 870,000 ESTs as of October 2003) 
suggested low transcription levels. We developed the following TaqMan systems for 
ZCCTl and ZCC72and used available /ICTZVand UBIQUnJN systems as endogenous 
controls (Example 1). TaqMan probes for ZCCTl and ZCCT2\Nere located in the 
junction between exon 1 and exon 2 to avoid genomic DNA amplification. The 
specificity of the two systems was confirmed by repeated experiments using as a 
substrate the cDNA clones from ZCCTl and ZCCT2. 

Test for amplification efficiency for TaqMan systems 

Tests for amplification efficiency were performed. Six 2-fold dilutions tested in 
triplicate; 1:1, 1:2, 1:4, 1:8, 1:16, 1:32. Standard curves were plotted with and slope 
and the differences between the slopes with the 185 standard curve were calculated. 
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Criteria for passed test: differences of slopes < 0.1. The efficiency based on tlie slope 
was also calculated. 

ZCCTl TaqMan System 

Left primer: CCAACACATGGCTCACCTAGTG (SEQ ID NO: 93) 

Probe (reverse orientation): AAATGGCACGATGTGGGCTCTTGCC (SEQ ID NO: 94) 

Right primer: TTGCTTCATTGCTAATAGTGTTGGT (SEQ ID NO: 95) 

Amplification efficiency - ZCCTl 

• Differences of the slopes with 18S: 0.027: ZCCTl system passed efficiency 
test 

• Efficiency 99.8 

ZCCT2 TaqMan System 

Left primer; CCACCACTGCAGATCATGGA (SEQ ID NO: 96) 

Probe (reverse orientation) : CCAAGAACCACCATCGTGCCATTCTG (SEQ ID NO: 97) 

Right primer: TTGCTAATAGTGCTGGTGAATGC (SEQ ID NO: 98) 

Amplification efficiency - ZCCT2 

• Differences of the slopes with IBS: 0.023: ZGCT? system passed efficiency 
test 

• Efficiency 99.6 

Time course expression of ZCCTl under long day and continuous liglit 
(See figure 17) 

Samples were extracted from leaves of unvernalized Triticum monococcum 
G3116 every 4 hours. The first 6 samples were extracted from plants located in the 
greenhouse under long day conditions. After the 2am sampling plants were transferred 
to a growth chamber under continuous light. Values are averages often plants ±SE. 
Units are linearized values using the 2^'^V method, where Ct is the threshold cycle. No 
significant differences In ZCCTl linearized values were detected among the different 
collection times under continuous light {P= 0.25) and highly significant differences were 
detected under the long day conditions {P< 0.0001). 

During the eight weeks of the vernalization experiment (16 h of light), we 
observed a progressive decrease of both ZCC7"transcrlpts In the leaves relative to 
ACTIN{F\q. 14B) or UBIQUTUN. Plants kept in the greenhouse showed stable transcript 
levels during all the experiment. An Important observation was that ZCCTtranscrlptlon 
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was not restored after removing the vernalized plants from the cold room and returning 
them to the greenhouse (Fig. 14, 2w out). A similar stable repression of transcription by 
vernalization was observed In the unrelated Arabidopsis MADS-box gene FLC, 
suggesting that the ZCCT genes may play a similar central role in the repression of 
flowering in cereals. 

The downregulation of ZCCTl during vernalization was paralleled by an Increase 
of API transcription (Fig. 14B). We have previously shown that API is the wheat VRNl 
gene (Example 1) and that there are strong epistatic interactions between VRNl and 
VRN2{G. E. Tranquilli, etal. (1999)). The model for these epistatic interactions 
predicted the opposite transcription profiles observed in Fig. 14. According to this model 
the VRN2q&c\e acts as a repressor of flowering, which directly or Indirectly represses 
/l/'i transcription (Example 1). 

Quantitative PGR analysis of the transcription of the ZCCT genes in the apices 
provided the first evidence that ZCCTl was a better candidate for l//?/V2than ZCCT2. 
ZCCTl transcripts were present in the apices from the unvernalized winter plants but 
after six weeks of vernalization were reduced to undetectable levels. API transcripts 
showed the opposite pattern, being greatly induced after vernalization (Fig. 14C). We. 
were not able to detect transcripts of ZCCT2 in the same RNA samples where we 
detected ZCCTl and API. Transcripts from ZCCT2v4&[q detected in positive controls 
from leaves. These results suggested that ZCCT2\Nas either not expressed in the apices 
or its transcription level was below our detection threshold. Since apices are the critical 
points for the transition between the vegetative and reproductive phases, these 
experiments suggested that ZCCTl was a better candidate for l/7?/V2than ZCCT2. 

An interesting observation was that the transcript level of ZCCTl and ZCCT2 
varied significantly during the day. However, no significant variation was observed 
when plants were transferred from the long day conditions (16 h light) to continuous 
light, suggesting that the circadian clock was not involved in the regulation of ZCCT 
transcription. We found that ZCCTtranscription was rapidly upregulated when plants 
were moved from the dark to the light (Fig. 14D), and downregulated when moved 
from the light to the dark (Rg. 14E). The role of this dual regulation of the VRN2 
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candidate gene by vernalization and liglit is still not clear, but it is tempting to speculate 
that this phenomenon might be related to the integration of photoperiod and 
vernalization signals in the regulation of flowering in temperate cereals. 

ZCCri transcription was downregulated during vernalization in both winter 
G3116 and spring DV92 plants, suggesting that the differences in growth habit were not 
originated by differences in the transcriptional regulation of ZCCTl. To test this 
hypothesis we compared the sequences of the promoter and coding regions from 
different l//?/V2 candidate genes between sprihg and winter accessions of cultivated T. 
monococcum from different parts of the world. We sequenced the complete coding 
region of ZCCTl from seven winter accessions, PI355522, PI277133, PI272561, 
PI573529, PI221413, PI355522, and G3116. None of the winter accessions carry the R 
to W mutation identified in DV92. 

The primers used to amplif/ cDNA of the ZCCTl gene were: 

C10ut3Fl: GGCTCCAATCGATCAATCAC (SEQ ID NO: 99) 
ClOutSRl: TTCTTCCTCGACGTCTCTCC (SEQ ID NO: 100) 

Allelic variation among cultivated diploid Triticum monococcum 

We observed no differences In the AF459088.3 protein between m72-spring 
accession DV92 and l^/772-winter accessions PI355532 and PI277133. Similarly, no 
differences were found in the predicted ZCC72 proteins between K/772-spring accession 
DV92 and l//772-winter accessions PI272561 and PI277133. The promoter region 
(1,098-bp) and the 3' region (736-bp) of the ZCC72gene from winter accession 
PI272561 were also Identical to DV92. These results suggested that the differences In 
vernalization requirement were not associated to differences in the coding sequences of 
these two genes or in the regulatory sequences of ZCCTl. 

No differences were found either for the promoter region of ZCCTl between 
DV92 and winter accession PI272561. We compared the 638-bp promoter region 
downstream from the start codon of the ZCCTl in spring accession DV92 with the same 
region in winter accessions G3116, PI272561, and PI573529. The promoter sequence of 
DV92 was identical to the sequence of winter accession PI272561 confirming that the 
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differences detected between \/rn2ar\6 vrn2 alleles was determined by differences in 
the ZCCTl protein rather than differences in its transcriptional regulation. ^ 
Primers used to amplify the promoter region: 

ClProFl: TGAGGCGCGGGCAGTTGTTG (SEQ ID NO: 101) 
ClProRl: GGTTAAGCTTGGGGGAGAAG (SEQ ID NO: 102) 

However, comparison of the ZCCTl coding region from DV92 with cultivated T. 
monococcum accessions with a winter growth habit provided good evidence that ZCCTl 
was the l//?/V2gene. The spring accession DV92 carried a point mutation at position 35 
of the CCr domain that replaced an Arg (R) amino acid by a Trp (W). This R amino 
acid was conserved in all the ZCCT proteins (Fig 29A) and in all the CO-/ike proteins 
from Arabidopsis, rice and barley (S. Griffiths, eta/. (2003)). A point mutation at the 
same position in the CCT domain from CO in Arabidopsis EMS mutant co-7 did not 
affect the nuclear localization of the CO protein but produced a severe effect on 
flowering time (F. Robson, eta/. (2001)). Kurup et al. (2000) suggested that the CCT 
domain might be involved in protein-protein Interactions, and therefore, a mutation 
within this domain can disrupt these interactions and the function of the involved 
proteins. The conservation of the 35-R amino acid In all the CCT domains, and the 
strong effect of its mutation on flowering time indicate that this amino acid is essential 
for the correct function of the CCT domain and that the point mutation observed In 
DV92-ZCCn provides a good explanation for its spring growth habit. 

The Arg/Trp mutation in DV92 determined a unique /\/co I restriction site, which 
was absent In the wild allele (the probe for this polymorphism is described above). This 
polymorphism was used to screen a germplasm collection of 65 accessions of cultivated 
T. monococcum from different parts of the world. The Arg/Trp mutation was absent In 
all 16 winter accessions, but present In 22 of the 49 spring accessions. Screening of the 
remaining 27 spring accessions by hybridization with ZCCTl showed that 17 accessions 
had a complete deletion of ZCCTl and ZCCT2. Seven of the remaining spring accessions 
showed a 1-bp deletion in the VRNl promoter that explained their spring growth habit 
(SEQ ID NO: 17). We have Initiated crosses between the last three spring accessions 
and tester lines DV92 and G3116 to determine the location of the gene responsible for 
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the spring growth habit in these lines. It is interesting to point out that these three 
accessions originated from the eastern border of the T. /77o/70cocc(y/77 distribution 
(Bulgaria, Romania, and Russia). 

We confirmed experimentally that the complete ZCCT deletion was allelic to the 
Mr/72 allele from DV92. The Fi hybrid between accession PI190915 carrying the 
complete deletion and winter accession G3116 had a winter growth habit whereas the 
cross with DV92 showed a spring growth habit. In addition, all the F2 plants from the 
cross PI190915 x DV92 had a spring growth habit. In summary, the described 
mutations at the ZCCTl and VRNl genes were sufficient to explain the spring growth 
habit of 92%, of the cultivated T. monococcum accessions analyzed in this study. 

This provides supporting evidence to the importance of API and ZCCTlgenes in 
the determination of growth habit in diploid wheat 

Allelic variation in bariey 

The absence of both ZCCT genes in the orthologous BAG from barley variety 
Morex (Fig. 13), suggested that this variety carries a recessive allele. Hybridization 
of barley genomic DNA with wheat ZCCTl clone (UCW39) showed no fragments in 
Morex, but three A&al-fragments In winter Hordeum spontaneum. These three RFLP 
fragments were completely linked to SNF2P\u 102 F2 plants from the cross between 
Morex and H. spontaneum, demonstrating that barley Morex has a recessive vrn-H2 
allele completely linked to the ZCCT deletion. 

We cloned and sequenced two ZCCT genes from winter barley Dairokkaku. A 
Neighbor Joining cluster analysis of the complete wheat and barley proteins showed 
that the two barley genes were more similar to each other than to wheat ZCCTl or 
ZCCT2 genes. This lack of correspondence between the wheat and barley genes was 
expected because the divergence time between wheat and barley (11-15 million years 
ago (W. Ramakrishna, etal. (2002))) was close to the time of the duplication of the 
ZCCT genes (11-16 mya). The two barley genes were designated ZCCT-Ha and ZCCT- 
Hb 

To study the distribution of the deletion of the ZCCT genes in barley and its 
association to the 17772 allele, we screened a collection of 85 barley varieties from 
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different parts of the world that were previously characterized genetically for their 
vernalization alleles (R. Takahashi, (1956)). Hybridization of Southern blots with DNAs 
from these varieties with the ZCCTl probe showed the presence of three Dra I 
fragments in the 23 winter varieties, and their absence in 61 i//77-//2-spring barley 
varieties (C.-L Chen (2002)). The vrn'H2 spring barley variety Tan' was the only 
exception, showing a single Dra I fragment when hybridized with ZCCTl. Sequencing of 
part of Fan ZCCT gene showed that it was identical to ZCCT-Hb from winter variety 
Dalrokkaku. In summary, a perfect association was observed in barley between the 
presence of ZCCT deletions and the i//77-/y2 allele. 

Validation of ZCCTl as VRN2 by RNAi transgenic wheats 
We transformed winter bread-wheat variety Jagger with an RNA interference 
(RNAi) construct including a 347-bp segment from T. monococcum ZCCTl gene. Three 
positive TO plants from three independent transformation events were identified by 
PGR. However, only one of the three TO transgenics flowered earlier (23 days) than the 
negative control. An RT-PCR experiment using primers for the transcribed PolyA region 
from the vector confirmed the expression of the RNAi transgene in the early flowering 
transgenic plant and its absence in the negative controls (Fig. 15A). 
RNA interference 

The RNAi construct was made in the binary vector pMCG161 (available on the 
Internet at www.chromdb.org/mcgl61.html). This vector contains a cassette designed 
for making inverted repeat transcripts of a gene, flanking a loop, which should 
efficiently produce a double stranded RNA. Expression of the transgene is driven by the 
35S promoter followed by the AdhMvon. 

We cloned a 361-bp segment from ZCCTl (90-bp to 436-bp, excluding the CCT 
domain and the Zinc finger) In sense orientation between restriction sites Ascl -AvrW 
and in antisense orientation between restriction sites Sgfl - Spe I. The engineered 
recombinant plasmid was co-transformed with UBI:BAR into immature embryos of 
Jagger, a hard red winter wheat, by microprojectile bombardment as described before 
(P. A. Okubara, etal. (2002)). Jagger is less responsive to tissue culture and more 
sensitive to bialaphos than Bobwhite, the cultivar used in previous work, and therefore 
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the following additions were made to the post-bombardment callus maintenance and 
regeneration media: 5 uM cupric sulfate and 0.1 mg/L benzyladenopurlne as suggested 
by Cho et al. (M. J. Cho, etal. (1998)). Selection of transformants was done by addition 
of 3 mg/L bialaphos to shoot regeneration and Img/L bialaphos to rooting media. 
Positive plants were confirmed by PGR using primers designed based on the vector 
sequence flanking the sense and antisense insertions. 

Ri_S_F: GTTGAGTGGCCCTGTTTCTC (SEQ ID NO: 103) 

Ri_S_R: CATTGATCAGCCTAACCAAACA (SEQ ID NO: 104) 
Expected product: 741 -bp 

Ri_AntiS_F: CAAATTCTAATCCCCAATCCAA (SEQ ID NO: 105) 

Ri_AntiS_R: GGCGGTAAGGATCTGAGCTA (SEQ ID NO: 106) 

Expected product: 632 -bp 

Transcription of the transgene In the three selected transgenic plants was 
confirmed by RT-PCR using primers for the transcribed Octopine Synthetase PolyA 
region of the pMCG161 vector. No amplification was detected in the control plants. 

OCS-PolyA_F: AGTGGGTCTAGAGTCCTGCTT (SEQ ID NO: 107) 
Ri_AntiS_R: GGCGGTAAGGATCTGAGCTA (SEQ ID NO: 108) 

Expected product: 124 -bp 

Transcription level of ZCCTl was tested using the ZCCTl TaqMan system. To 
avoid amplification from the transgene, the reverse transcription was performed with 
primer Race_C12Fl that is outside the 361-bp region included in the vector. As a 
control, ^CTZV primer Actin_L was also included in the reverse transcription reaction. 
Transcription level of API in the transgenic plants was evaluated using the API TaqMan 
system (See Example 1). 

Quantitative PGR experiments showed that only one of the early flowering 
transgenic plants showed a downregulation of ZCCTl and upregulation of API in the 
leaves (Fig. 15A). This plant flowered 23 days eariler than the negative control (Fig. 
15A). 

We self-pollinated the eariy flowering transgenic TO plant and determined the 
presence or absence of the transgene in 45 plants from the Tl progeny by Southern 
blots. Hybridization of genomic DNA with the 35S promoter from the vector showed a 
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single fragment that segregated In a perfect 3:1 ratio (34 plants present vs. 11 plants 
absent). All plants carrying the transgene flowered earlier (3-5 weeks) than the 11 
plants homozygous for the absence of the transgene. This experiment confirmed that 
the reduction of the RNA level of ZCCTl is directly associated with the acceleration of 
flowering time. 

A new vernalization pathway 

The complete linkage between ZCCTl and VRN2 in a large mapping population, 
its gradual and stable transcriptional downregulation during vernalization, its opposite 
transcription profile to API, the association between natural allelic variation at ZCCTl 
and spring growth habit in four independent mutation events, and the elimination of the 
requirement of vernalization by RNAI of ZCCTl transcripts, demonstrated that ZCCTl is 
the VRN2qe.r\e. 

Therefore, the central repressor of flowering in the vernalization pathway in 
temperate cereals is a gene that Is not present In Arabldopsls. The vernalization 
pathway In the temperate cereals also differs from that In Arabldopsls In the direct 
regulation of /^/'i transcription by vernalization (Example 1). Therefore, we conclude 
that the temperate grasses developed a vernalization pathway de novo, using a 
different set of genes than Arabldopsls. 

Arabldopsls has been recently compared to the Rosetta stone because of its 
huge contribution to our understanding of the "language of flowering" (G. G. Simpson, 
etal. (2002)). However, there are more human written languages than those included 
in the Rosetta stone. In a similar way, this Example shows that there might be also 
multiple "languages of flowering" that will require dedicated research efforts to be 
deciphered. This Is parKculariy Important for the crops that feed our worid. 
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